Mon. Not. R. Astron. Soc. 000, 000-000 (0000) Printed 15 April 2013 (MN 1^1^ style file v2.2) 

Estimating gas masses and dust-to-gas ratios from optical 
spectroscopy 

en . Jarle Brinchmann^'^*, Stephane Chariot^, Guinevere Kauffmann^, 



(N 

Oh 
< 



■ Tim Heckman^, Simon D. M. White^, Christy Tremonti^ 

^Leiden Observatory, Leiden University, P.O. Box 9513, 2300 RA, Leiden, The Netherlands 

'^ Centro de Astrofisica, Universidade do Porto, Rua das Estrelas, 4-150-762 Porto, Portugal 

^UPMC-CNRS, UMR7095, Institut d'Astrophysique de Paris, F-75014, Paris, France 

"^ Max- Planck Institut fiir Astrophysik, 8574-1 Garching, Germany 

^ Johns Hopkins University, Baltimore, Maryland 21218, USA 

^ Department of Astronomy, University of Wisconsin- Madison, 1150 University Ave, Madison, WI 53706, USA 



O 
U 

Oh! 

o 



> 

<N 

en 

o 
en 



% 



15 April 2013 



ABSTRACT 

We present a method to estimate the total gas column density, dust-to-gas and dust-to- 
metal ratios of distant galaxies from rest-frame optical spectra. The technique exploits 
the sensitivity of certain optical lines to changes in depletion of metals onto dust grains 
and uses photo-ionization models to constrain these physical ratios along with the 
metallicity and dust column density. We compare our gas column density estimates 
with H I and CO gas mass estimates in nearby galaxies to show that we recover their 
total gas mass surface density to within a factor of 2 up to a total surface gas mass 
density of ~ 75 M© pc~^. Our technique is independent of the conversion factor of 
CO to H2 and we show that a metallicity dependent Xco is required to achieve good 
agreement between our measurements and that provided by CO and Hi. However we 
also show that our method can not be reliably aperture corrected to total gas mass. 
We calculate dust-to-gas ratios for all star-forming galaxies in the Sloan Digital Sky 
Survey Data Release 7 and show the resulting dependence on metallicity agrees well 
with the trend inferred from modelling of the dust emission of nearby galaxies using 
far-IR data. We also present estimates of the variation of the dust-to-metal ratio with 
metallicity and show that this is poorly constrained at metallicities below 50 per cent 
solar. We conclude with a study of the inventory of gas in the central regions, defined 
both in terms of a fixed physical radius and as a fixed fraction of the half-light radius, 
of ~ 70, 000 star- forming galaxies from the Sloan Digital Sky Survey. We show that 
their central gas content and gas depletion times are not accurately predicted by a 
single parameter, but in agreement with recent studies we find that a combination of 
the stellar mass and some measure of central concentration provides a good predictor 
of gas content in galaxies. We also identify a population of galaxies with low surface 
densities of stars and very long gas depletion times. 

Key words: galaxies - galaxies: evolution - galaxies: individual - galaxies: funda- 
mental parameters - galaxies: ISM 



1 INTRODUCTION 

The baryonic content of a galaxy exists in a range of 
states, from cold molecular gas to diffuse hot gaseous haloes, 
yet for most galaxies our knowledge of the baryon con- 
tent is biased towards the stars. Indeed, the last decade 
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has seen a massive increase in our understanding of the 
stellar co ntent of galaxies oyer a wide ran^e in red- 
shift (e.^. Brinchmann fc Ellis' '2OOO'; 'Dickinson et al.' '20031; 
iBaldrv et al. 2008; Marchesini et al. 2009; Shaplev 2011, for 
a review) . Stellar masses and star formation rates are regu- 
larly obtained from broad-band photometry or optical spec- 
trosc o py for large samples of ga l axies (e.g. Bell fc de Jond 



I2OOII : iKauffmann et al.l l2003bl : iBrinchmann et all l20o3 : 
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ISalim et al."2007'; 'Blanton k Roweidl2007l : IPforr et al.llioTJ : 
[iMar aston et al. 2012), but to fully understand the baryon 
cycle in galaxies we need to also understand the distribu- 
tion of cold gas in galaxies. 

The atomic gas content of galaxies can be traced us- 
ing the 21cm line of Hi, and integrated Hi masses are now 
available for large samp les of galaxies from H i-selected sur- 
yeys such as HIPA SS (|Barnes et al.1 l200lh and ALFALFA 
(JHavnes et al.ll2011) and targ eted surveys such as the GASS 
survey ([Catinella et al.ll2012l l , which has a uniform gas frac- 
tion limit rather than a gas mass limit. Resolved H i maps are 
available for sma her, but still substantial sa mples of galaxies 
such as WHISP (Ivan der Hulst et al.ll200ll), the Ursa Major 
Cluster survey (' TuIIv et a l."l996':'Verheiien fc Sancisill200lh 
and the THINGS survey (Walter et al. 2008:|). 

The cold molecular gas is mostly composed of H2, but 
given its lack of transitions that can be excited at low tem- 
peratures, molecular gas measurements are mostly done us- 
ing CO as a proxy for H2. Single-dish CO surveys have gen- 
erally been targeted (e.g. Braine et al 1993; Boselli 1997) 
but for somewhat inhomogeneous sampl es. A notable excep - 
tion here is the COLDGASS survey diSaintonge et al.ll2Qllal ) 
which obtained CO observations for galaxies from the GASS 
surv ey. As with H i, resolved CO studies have a long history 
fe.g. lYoung fc Scoville 199ll : lYoung et al.1 [19951 ). recent ex- 
amples are The Berkeley Illinois Maryla nd Association Sur - 
vey of Nearby Galaxies (BIMA SONG; iHelfer et al.ll2003l l 
interferometric maps of 44 s piral galaxies , the Nobeyama 
survey of 40 spiral galaxies iKuno et al.l |2007) and the 
HERA CO Line E xtragalactic Survey (HERACLES) by 
iLerov et al.l (|2009bl l which mapped 18 nearby galaxies with 
extensive multi-wavelength data. 

These methods are powerful but they have important 
drawbacks: While H l traces the dominant ingredient in the 
atomic gas, it is very challenging to detect significant num- 
bers of individual galaxies in Hi past z ^0.3 with cur- 
rent facilities ( e.g. Jaffe et al.l I2OI2I : |Verheijen et al.l l20ld : 



ICatinella et al.l l2008V In contrast, CO observations can be 
done out to significant redshifts, but they do not trace the 
dominant component of the molecular gas and a conversion 
factor between the column density of CO and H2 (Xco) 
or CO luminosity and total molecular mass (aco) must be 
used. Both aco and Xco are believed to vary considerabl 



used, t^otn aco and Aco are believed to vary considerably 
with metallicity and ionisation conditions (e.g. Israelii 1997bl : 



iBoselli et aDl2002 ': 'lsrael"2005l: iLerov et al.ll2009al ). although 
a quantitative understan ding of this effect is only now be - 
coming available ("e.g. lLerov et al.ll201ll : lGenzel et al.ll2012l V 
In part due to these constraints, and also because Hi 
and CO surveys are very time-consuming, alternative meth- 
ods have been proposed to infer gas content in galaxies. 
These can be roughly divided into methods that make use of 
scaling relations to infer gas content, and those that measure 
gas content using a more distant tracer than CO. The for- 
mer class has a long history and sprang out of the realisation 
that there is a good correlation between galaxy colour and 
the ratio of H i mass to B-band luminosity (Roberts 1969; 
lBothunlll984l ), or alternatively between galaxy morphology 
and Mj^ JLr (iRoberts &: Havnedl 19941 ). This technique was 
used bv lBell et al.l (|2003l l to infer a baryonic m ass function 
for th e low redshift, z < 0.2, U niverse. Similarly Kannappan 
(I2OO4I . K04) and Zha ng et al.1 (2009, Z09) have provided re- 
lations between colour and Hi gas- fractions which can be 



applied to large samples of galaxies. This has the advan- 
tage of being easy to apply to large sampl es of objects (e.g. 
iKauffmann et al.ll2010l : IZhang et al"1l2012l ). A similar tech- 
nique in this class is to invert the empirically found relation- 
ship between star formation rate and gas surf ace density, the 
Kennicutt-Schmidt relation Ken nicuttI (|l99 8, and references 
therein) . This requires an estimate of the st ar formation rate 
which is then used to infer a gas mass (e.g. lErb et al.ll2006l : 
iTremonti eraDl2004l : IPuech et al.ll201Qh . These relations do 
however rely on local calibration samples and are therefore 
of questionable use for applications at higher redshift. Even 
more importantly they are calibrated to give average trends 
and are not suitable for detecting outliers or peculiar sys- 
tems. We will discuss these issues further in section [531 

The second class of methods aims instead to infer gas 
content via a tracer of gas content. The most widely used 
technique in this class is to make use of dust emission to infer 
gas co ntent, building on the observation bv lBoulanger et al.l 
(| 19851 ) that a comparison of IRAS and H i maps implied that 
dust is a reliable tracer of cold gas. It ha s been used to est i- 
mate gas content in the Milky Way ("e.g. lPame et al.lbOOl l 



and in n earby galaxies by a nurnber of authors (e.^. [ Israeli 
iE, I2OO5I : iLerov et al.ll2009al : iBoouien et alli2013[ 



11997; 



it has been extended to sub-mm observations (James e t al 
2002) and more recently to Herschel data (e.g. Eales e t al 
I2OIOI . I2OI2I . see also Roman-Duval et al 2010). These mod- 



els depend on dust properties, such as the emissivity and 
temperature, and on the typical beam sizes for sub-mm and 
far-IR observations which are considerably larger than the 
resolution obtained in the optical. Modelling the dust emis- 
sion can, however, yield a plethora of additiona l information 
(e.g. 'Prai ne et aT1l2007l : Ida Cunha et al.ll2008l ). Of particu- 
lar interest to this paper is the ability to const rain the dust- 
to-gas ratio of the interstellar medium (e.g. lAniano et al.l 
.2012). 

Here we will discuss a related technique, but instead 
of exploiting the dust emission, we will make use of the 
dust absorption. As we will discuss below this can be ap- 
plied to large samples of galaxies, is relatively insensitive 
to the detailed properties of the dust grains, and like the 
dust emission measure, is sensitive to the total gas content 
and dust-to-gas ratio. At low redshift it therefore offers a 
promising complement to direct detection of gas for char- 
acterising its properties, while at higher redshift it offers 
a simple method to measure the gas content and dust-to- 
gas ratios of large samples of galaxies. As we will outline 
below the method, being based on optical data, generally 
has higher spatial resolution than interferometric Hi and 
CO maps, but it is naturally limited to the stellar disk of 
galaxies since it is an absorption technique. Thus most of 
our results in the final part of the paper will focus on the 
gas content in the central few kpc of galaxies, nicely com- 
plementing the integrated measurements gas measurements 
provided by single-dish observations. 

The plan of the paper is as follows: In sections [2] and [3] 
below we will discuss our method for modelling gas content 
in some detail, including the importance of priors on our 
model parameters. This will result in a model which links 
dust content, metallicity and dust-to- metal ratio to produce 
an estimate of gas column density in a given system. We 
review the data we will use to validate our technique in 
section [4l and emphasise the importance of the chosen at- 
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tenuation curve. Section [5] is devoted to tests of our method 
and we start with a study of how the dust-to-metal and 
dust-to-gas ratios in galaxies depend on redshift, showing 
good agreement with modelling of dust emission in nearby 
galaxies. We then show that our gas column densities agree 
with those inferred for the same regions from H i+CO maps 
in nearby galaxies and exploit the aperture trends inherent 
in Sloan Digital Sky Survey (SDSS) spectroscopic data to 
check that we can recover well-known scaling relations. In 
section [6] we discuss the gas content and gas depletion times 
in the central regions of ^^ 70, 000 galaxies in the local Uni- 
verse, selected from all ^ 200, 000 star-forming galaxies in 
the DR7. We conclude in section [71 Throughout we adopt 
a cosmology with Hq — 70km s~^ Mpc~^, Vim — 0.3 and 
r^A = 0.7 . Where relev ant we adopt a Kroupa initial mass 
function ([KroupalbOOlh . We will denote surface densities by 



2 ESTIMATING GAS MASSES FROM DUST 
ATTENUATION 

From observations it is clear that dust and gas trace each 
other fairly closely, at least when averaged over sufficiently 
large regions. Here we wish to build on this to construct a 
probe of gas content using the dust in absorption as a probe 
of the gas column density and exploiting the temperature de- 
pendence of emission lines to place constraints on the dust- 
to-gas ratio; together they will then provide an estimate of 
the effective gas column density in the region probed by the 
spectrum. We note that our focus here will be on the at- 
tenuation of emission lines only, but if a separate constraint 
on the absorption of the stellar continuum can be had (e.g. 
iKau ffmann et al. 2003b), further constraints on the model 
can in principle be obtained. 

We adopt the simple dust attenuation model by 
ICharlot k Fall (|200Q| . CFOO) and it is therefore convenient to 
distinguish between attenuation in the stellar "birth clouds" 
(i.e. molecular clouds), tbc and in the "ambient" (i.e. dif- 
fuse) ISM, T^s^. 

BC I ISM /-,N 

TV ^ Ty + Ty (1) 

CFOO found a good fit to their sample of galaxies with the 
assumption that the attenuation in the diffuse ISM is typi- 
cally only abo ut 1/3 of the total atten uation affecting young 
stars (see also Ida Cunha et al.l |2008|) , but the actual value 
turns out not to matter for our argument here (but see the 

discussion in section [3?2l) . 

We now follow ICharlot fc Longhettil (|200lL CLOl) and 
introduce a new variable, (f , which is the dust-to-metal ratio 
of the ionised gas. 



? = 



Mz' 



(2) 



where Md is the dust mass and Mz is the mass in metals. 
Alternatively this can be written as 



i = 



E.m.^(X,)(t) 



(3) 



where ^(X^) is the linear depletion of element X^, and rrii 
is the mass of element X^ and where (X^/H)^ is the cosmic, 
ie. undepleted, abundance of element X^. We will return to 



the question of expected values for ^ below. Note that the 
combination ^Z, with Z being the total metallicity, provides 
a measure of the dust-to-gas ratio, and we will return to this 
in section [521 

The surface mass density of gas contributed by the 
birth clouds (assumed spherical) can thereby be derived us- 
ing the definition of ^ = T^d/^z where S^ is the surface 
density of dust and Sz that of metals, and we have made 
the assumption that the effective scale-length of metals and 
dust is the same. In addition the metallicity can be written 
Z = i;z/5]gas, which taken together gives 



^BC 



BC 



Ty md/{(Td^Z), 



(4) 



where rud is the mass of the dust grain and ad its optical 
cross-section — these should be viewed as effective, angle- 
averaged, quantities in the sense adopted by CFOO. 

According to CFOO, the model that produced the bet- 
ter fit to their attenuation curve was on based on a Poisson 
distribution of discrete clouds. Let Ty be the optical depth 
per cloud and n be the mean number of clouds encountered 
along different lines of sight. The observations of starburst 
galaxies favour 0.1 < Ty < 0.5 with a best- fit value of 0.3. 
With these additional assumptions about the optical prop- 
erties and the spatial distribution of dust, we can relate Ty ^ 
to Sgl^. For a Poisson distribution, the effective absorption 
optical depth in the diffuse ISM is (CFOO; Eq 25) simply 



ISM 



■ exp(-rvj 



(5) 



with typically Ty = 0.3. Now, to go from a Poisson distri- 
bution for the probability density of absorption, to a Pois- 
son distribution of gas clouds, we require a new assumption: 
that scattering is mostly forward. This seems to be well sup- 
ported by both mo dels and obser vations of interstellar dust 
in the Milky Way. iDraind (|2003|) compared predictions of 
(cos 0) from models with a range of observations and found 
that {cos 6) > 0.6 in all environments at the optical and UV 
wavelengths of interest to us (see also Gordon (2004) and 
references therein). In this case, the mean surface mass den- 
sity of gas along different lines of sight in the diffuse ISM is 
simply: 



yiSM _ yC 



(6) 



where Sgas is the surface mass density of individual clouds 
(note that, strictly, we have adopted here a Poisson distribu- 
tion of identical, face-on screens rather than one of spherical 
clouds). Substituting n from equation (|5]), we then have 



^ISMvc 



Y^iisivi 



exp(- 



(7) 



The mass density of an individual cloud can be inferred 
as in equation Q above as 



Sgas = TyUld/iad^Z) 



(8) 



We can then substitute this in equation JT)) and expand 
in powers of Ty. For optically thin clouds (recall Ty = 0.3), 
we can write 



^ISM 



r(?"md/(adC^). 



(9) 



It is important to note that this is only an approximate 
formula, which breaks down for more optically thick clouds, 
but for Ty = 0.33, the next term is only 17 per cent of the 
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equation above. With all the assumptions listed above, we 
can then write 



ySC I yISM 



gas 
BC 



(r^ 



+ Tf'^)ma 



c'dS.Z 



c^diZ ' 



(10) 

(11) 
(12) 



For nid and aa corresponding to Galactic- type dust, we 
adopt a mass- weighted mean grain radius oi a — 0.1 /jm 
(using a MRN distribution between 0.005 and 0.25 mi- 
cron), ad ~ Tva^ (since we assumed forward scattering) 
and pd ~ 3g cm""^ (the mean mass density of silicates and 
graphite grains). This yields 



:O.2gM0pc-^ 



(13) 



The numerical pre-factor in equation ()13p is fairly robustly 
determined as it only depends on the mass-weighted ra- 
dius to the first power as well as the reasonably well- 
determined density of carbonaceous and silicate grains. Us- 
ing the Weingar tner fc Drai ne (2001, WDOl) model we find 
that for Rv = 3.1 the pre-factor ranges from 0.17 to 0.23 
across the various C abundances considered by those au- 
thors. For LMC and SMC dust from the WDOl modelhng, 
the pre-factor ranges between 0.21 and 0.26. Thus we con- 
clude that the overall scaling is robustly determined, at least 
across a range of metallicities from the Milky- Way to the 
SMC. We will adopt equation ([13)) in the following. In pass- 
ing we comment that if we have a separate constraint on 
Ega,^ from e.g. the attenuation of the stellar continuum, it 
might be possible to extend this method to provide an es- 
timate of the relative gas content in the ISM and in birth 
clouds and it would also allow constraints to be placed on 
the gas column densities in galaxies without significant on- 
going star formation. 

Despite the overall robustness of the scaling factor to 
the dust grain properties, the derivation above does depend 
on a number of uncertain assumptions so it is essential to 
test the results carefully. In particular it is not clear at what 
density the simplification made in equation Q will break 
down, leading to an underestimate of the gas density. We are 
also sensitive to the exact extinction curve being adopted as 
well as the metallicity estimator used and we will discuss 
this further in section [S] Finally, at very low column densi- 
ties our modelling will break down at some point. Since the 
technique is based on absorption it might be expected to 
be more sensitive than emission-based probes such as CO or 
H I, but we know less about the properties of the attenuation 
curve at low gas densities thus our systematic uncertainties 
could become important here and the method as discussed 
here relies on a background source with emission lines which 
might not be present at very low densities. 

It is worth emphasising that this approach is sensitive to 
the total, ie. atomic plus molecular, gas surface mass density 
in the observed region. This is crucial for testing because it 
implies that using only Hi is insufficient where significant 
contributions to the gas mass are expected to be in molecular 
form. 



3 DERIVATION OF Egas THROUGH FITTING 
TO EMISSION LINES 

We now discuss briefly how we use equation (|13p to con- 
strain Sgas- We need a framework to constrain the metal- 
licity, the dust attenuation and the dust-to-metal ratio of 
the ionised gas. Furthermore we have to take into account 
the fact that these quantities likely will be correlated. To 
a ddress these issues we follo w closely the approach outlined 
in lBrinchmann et al.l (|2004l, hereafter B04) and use a grid of 
the models derived bv lCharlot fc Longhetti (|200lh . We will 
also discuss the choice of priors and a number of uncertain 
ingredients in our modeling and comparisons. For ease of 
reference this is summarised in Table [2] below. 

The CLOl models provide predictions for emission 
line strengths, combining evolving galaxy spectra from the 
iBruzual & Chariot (1993) models with the Cloudy pho- 
toionisation code ([Ferland et al.l ll998l V The models are 
parametrised by five main parameters and are calculated 
on a regular grid in all these parameters (see table 2 in 
B04). For our work here we need four of these parameters: 
the ionisation parameter, U, the dust-to-metal ratio in the 
ionised gas, ^, the dust attenuation in the V band, rv, and 
the metallicity, Z, note that this is the total metallicity in- 
cluding metals depleted onto dust grains and therefore dif- 
fers from the gas-phase metallicity. We then fit our grid of 
CLOl models to optical emission lines; when available we 
use the main optical strong lines [O ii], H/3, [O iii]4959,5007. 
Ha, [Nii]6584 and [S ii]6717,6731, we will discuss their im- 
portance later. Note that with the abundance patterns used 
in the CLOl models, the oxygen abundance of the sun is 
12 + log(0/H)Q = 8.82. 

The ^ parameter, the dust-to-metal ratio of the ionised 
gas, is not commonly used in photo-ionisation calculations 
and warrants a more extensive discussion. It is a linear 
depletion parameter and its average value, which is what 
we are concerned with, in the interstellar medium (ISM) 
of galaxies is in general not known very well from di- 
rect observations. However from the observed near-linear 
corr elation be tween metall icity and the dust-to-gas ratio 
(e.g.llssa et alj 1990: Lisenfel d fc Ferraralll998l : lDraine et al.l 
120071 : Ida Cunha et al. 2010, also see below) it is possible to 
conclude that it shows relatively little variation in nearby 
galaxies. Chemical evolution models do however show that 
^ is expected to show mod e rate evolution with time (e.g. 
lEdmunddlioOll : llnoud l2003l : ICalura et al"]l2008l \ justifying 
our decision to keep it as a free parameter. Note that the 
dust-to-gas ratio, focR = ^Z, is a derived parameter in this 
chosen parametrisation, we will return to this quantity in 
section 15.21 below. 

In the Milky Way the depletion pattern in the ISM 
along certain specific lines of sight is, however, fairly well- 
known. CLOl made slight modifications t o the default deple- 
tion p atter n of Cloudy wh ich is based on lCowie fc Songailal 
(^1986) and Jenkins (^1987). The most recent in-de pth dis- 
cussio n of depletion patterns is the careful study of ljenkind 
((20091) and while there are differences to previous works the 
overall agreement is reasonable — although unfortunately 
the most important elements for our needs, C and O, are af- 
fected by rather significant systematic uncertainties (see the 
discussion in Jenkins 2009). The depletion patterns and 
abundances adopted by CLOl results in an average ^ of 0.3. 
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Figure 1. Top panel: The mean electron temperature as a func- 
tion of metallicity for three different values of ^. Note that as 
more metals are removed from the gas-phase, the mean temper- 
ature goes up because the cooling is reduced. Bottom panel: The 
ratio of the [Oiii]5007 hue to the auroral hue, [Oiii]4363, as a 
function of metallicity. Note that the change in temperature has 
a much larger effect at high metallicity than at low — this limits 
our ability to constrain ^ at low metallicities. The curves in both 
plots were done at fixed ionisation parameter (log U = —3.4) and 
dust attenuation (ry = 0.17). 



This can be contrasted with I Jenkins! (|2009l l whose modelling 
implies a variation in ^ from ^~0. 2 to ^^0.6 along var- 
ious lines of sight in the Milky Way with both a different 
depletion pattern and cosmic abundances than CLOl. For 
reference, our ^ = 0.3 corresponds to F* ~ 0.2 in Jenkins' 
notation. 

Keeping other parameters fixed, the main effect of 
changing ^ is that removing more metals from the gas leads 
to an increase in the electron temperature in the H ii-region. 
We show this explicitly for the CLOl models in Fig.[T]where 
the top panel shows the electron temperature as a function of 
total metallicity for three different values of ^. As expected, 
a larger depletion (larger ^) leads to a higher temperature 
at fixed metallicity. 

The impact of this temperature change is not uniform 
in observable line ratios. The lower panel of the figure illus- 
trates this with the well-known temperature sensitive line 
ratio [O ill] 5007/ [O iii]4363 where we can see that the effect 
of the change in temperature is substantial at high metal- 
licity, but much less noticeable at low metallicities. 

An illustration o f the effect of varying ^ and Z 
in the [Baldwin et aD (|l98lL BPT) [Nii]6584/Ha versus 



[O iii]5007/H/3 diagnostic diagram, frequently used to clas- 
sify galaxies based on their emission l ine p roperties (e.g. 
iKauffmann et al.l l2003al : iKewlev et al.l l2006h can be seen 
in the leftmost panel of Fig. (2] The distribution of emis- 
sion line galaxies in the SDSS DR7 is shown as the un- 
derlying 2D histogram which is coloured according to the 
emission line classification, star- forming (purple), composite 
(light green) and active galactic nuclei (AGN, dark green). 
The middle panel in the figure shows the [S li]6717/Ha ver- 
sus [O iii]5007/H/3 diagnostic diagram with the background 
2D histogram again showing the distribution of SDSS DR7 
galaxies with the colour indicating the mean classification 
based on the first diagram. The final panel shows the ionisa- 
tion parameter sensitive ratio [O ll] 3727/ [O ill] 5007 plotted 
against [N ii]6584/Ha. 

The lines show the model tracks for fixed values of ^, 
increasing from 0.1 to 0.5 from left to right. There are three 
sets of nine lines corresponding to \ogU = —2.8, —3.4, —3.9 
respectively (see the rightmost panel), going from the top 
downwards in all panels. The colouring of the lines corre- 
spond to a change in log Z from 0.1 Zq for black to 4:Zq for 
red. The effect of changing rv is not shown in this figure but 
is minor in the first two panels and should be well known. 

As shown in Fig. [1] increasing ^ increases the tem- 
perature, but it also reduces the amount of oxygen in the 
gas-phase. At high metallicity the resulting increase in the 
electron temperature, Te, is more important than the re- 
duced abundance of oxygen, thus increasing the flux in 
[O III] 5007 relative to H/3, while nitrogen is assumed to be 
non-refractory and the increased Te leads to an increase in 
the [N ii]6584/Ha and [S ii]6717/Ha ratios (see also the dis- 
cussion in CLOl). This effect of ^ on the emission lines is 
the reason why we are able to place some constraints on 
its value from observations. At low metallicity and high Te 
the effect of ^ is weak leading to poor constraints on ^, and 
we will return to this point below and in Appendix 13.11 
In the first two panels the effect of the ionisation param- 
eter is not easily isolated at high metallicity, but the final 
panel shows that the inclusion of [Oil] 3727 helps separate 
the model tracks. This highlights the need to have a num- 
ber of emission lines across the optical range to constrain the 
various model parameters. We also see that the model grid 
cover the emission-line properties of star-forming galaxies in 
the SDSS. 

Given the CLOl model predictions we then adopt 
the Bayesian approac h outlined in B04 (see also 
[Kauffmann et al. 2003b j to derive the probability distribu- 
tion functions (PDFs) for the parameters of interest. Given 
a particular set of lines, {Li}, we use Bayes' theorem to 
calculate the log likelihood of each model, M{U,^,rv, Z), 



lnP(A^|{LO) 



E 



(U-AfMY 



+ lnPr, 



(14) 



where fi is the flux in line i, A is a scaling- factor and Jm 
corresponds to the relevant model. Pr denotes the prior on 
the model parameters and is shown in the top row of Fig. [3l 
see also Appendix 13. 1[ and we have ignored the overall nor- 
malisation factor which is acceptable as long as the cr^'s do 
not depend on the model. 

To calculate ai for the SDSS galaxies we start with the 
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Figure 2. The behaviour of the CLOl models in three diagnostic diagrams. The background histogram shows the distribution of emission 
hue galaxies in the SDSS DR7 and is coloured according to the mean emission line classification following B04 with star-forming (purple), 
composite (light green) and active galactic nuclei (AGN, dark green) classes shown. There are four groups of model tracks for different 
values of log U as indicated in the rightmost panel. The colour changes proportionally to log Z from Z = O.IZq in the top left to Z = 4:Zq 
at the bottom. For each log [/ value, except the highest, there are lines for nine ^ values, linearly spaced from ^ = 0.1 to ^ = 0.5. The 
purple star shows the location of the object used for the illustrations of the fitting process in Figures [3] and |4] and the blue star that 
of Figure [5] The two leftmost panels show two BPT diagrams with slightly different sensitivity to ^, while the rightmost panel plots 
[O ll] 3727/ [O III] 5007 on the y-axis against [Nii]6584/Hq; to better separate the lines for different ionisation parameter. 



Line 


Scaling factor 


[On] 3727 


2.199 


H/3 


1.882 


[Oiii]4959 


1.573 


[O III] 5007 


1.566 


Ha 


2.473 


[Nil] 6584 


2.039 


[Sii]6716,6731 


1.621 



Table 1. Factors to scale formal line fiux errors by derived from 
a comparison of duplicate observations of SDSS galaxies. 



formal line flux errors calculated from the SDSS error spec- 
trum. These errors do not include continuum subtraction 
uncertainties and are therefore likely to underestimate the 
true uncertainties. To adjust for this we make use of dupli- 
cate observations of 33,794 emission line galaxies. For these 
we calculate the difference in measured line flux, normalised 
by the formal uncertainty of this difference. When the errors 
are accurately estimated the distribution of these differences 
should be a unit Gaussian and we use this to find the em- 
pirical scaling factors for the uncertainties that would bring 
the duplicate observations in agreement within the errors. 
These are given in Table [T] For the other data sources we use 
published uncertainties. In both cases we furthermore follow 
B04 and apply an approximate theoretical uncertainty by 
adding 4 per cent of the line flux in quadrature to the un- 
certainties to get the final ai that enters into equation (p^ . 
This theoretical error is an approximate adjustment to take 
into account the fact that each model is characterised by a 
single set of parameters while in reality each region likely 
has a distribution of physical properties (see B04 for more 
discussion) . 

To obtain constraints on a particular quantity we follow 
normal practice and marginalise P onto the parameter in 
question. For the current discussion we are mostly interested 
in Sgas, and the probability in bin i is given by 



P(Egas,0 = ^P(>I|{L.}), 

where TZ is the region defined by 

n= {A4([/,e,rv,2)|log(Egas,i-<5s)< 

logff <log(Sgas,i+5E)}. 



(15) 

(16) 
(17) 



We use a regular grid in log Sgas with bin width Ae = 
0.05 dex between logSgas = — 1 and logEgas — 4 for the 
calculation of Sgas • 

For each PDF we also calculate the entropy 



S="^Pi\0g^P^, 



(18) 



where the sum is over the bins of the PDF. This is a gener- 
alised uncertainty estimate for the PDF and is useful when 
the PDF is multi-peaked, or for determining whether a dis- 
tribution is significantly different from the prior distribu- 
tion. For the latter question we also calculate the Kullback- 
Leibler divergence between the prior, Qi, and the final PDF, 



0?KL 



^P^logJ 



(19) 



Note that c/kl is often defined with natural logarithms 
rather than base- 2 logarithms, but we choose the latter for 
consistency with the entropy definition. 

The process is illustrated in Figures [3] & S] where we 
show how the likelihood distribution of five different quan- 
tities changes as an increasing number of lines is included 
in the summation in equation (|14p . Between one row and 
the next we add the line indicated in the left-hand edge 
to the sum. The first row in the figure corresponds to the 
fit when only one line is included; this obviously provides 
no constraints so returns approximately the priors applied 
to the different quantities. From left to right the columns 
correspond to the total dust attenuation in the F-band, 
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Figure 3. An example of tlie result of the fit to the spectrum given by plateID-MJD-fiberID=736-52221-287. The location of this object 
in the BPT diagram is indicated by the purple square in Figure [2] Each column shows the PDF for one derived parameter as emission 
lines are progressively included to the fit (equation [14}. The leftmost column shows the constraints on the total V-band optical depth, 
Ty. The following three columns show the metallicity, the ionisation parameter, log [/ and the dust-to-metal ratio of the ionised gas, 
^. The second-to-last column shows the log of the conversion factor between Ho; luminosity and star formation rate, ?7hq:5 in units of 
M0 yr"-*^ Lq . The final column shows logEgas in units of Mq pc~^. Given the construction of this plot, the top row essentially shows 
the prior adopted on the quantity in question in the model. In the last column we also show the PDF derived with a fiat prior on Egas 
in blue. See the text for a discussion. 



TV, the metal abundance in units of solar metallicity, log Z, 
the ionisation parameter, log U, the dust-to-metal ratio of 
the ionised gas, ^, the log of the conversion factor between 
Ha-luminosity and star formation rate, r/ua in units of 
Mq yr~^ Lq^ and finally the log of the gas column den- 
sity, Ugas, in units of M© pc~^. 

Focusing first on Fig. [S] in the first column we see the 
well-known fact that to get a good constraint on the dust 
attenuation at least two recombination lines must be avail- 
able (here Ha and H/3). Some (minor) improvement on the 
attenuation estimate is possible if a better constraint on the 
electron temperature, and thence the intrinsic Ha/H/3 Case 
B ratio, can be had through the addition of [Nil] 6584 and 
[S Ii]6717. Fig.|4]shows the same but with the lines added in 
the reverse order which illustrates the usefulness of H/3 and 
[O Ii] — for this particular object [O ii] does not add much 
but H/3 is essential. 

The second column shows the constraints on the metal- 
licity — in this case this is very similar to what is obtained 
for t he oxygen abund a nce, and shows the well-known fact 
(e.g. 'Pagel et al. NlQSOl : lMcGaughl ll99l') that an abundance 
indicator based on [Oil] 3727, [O III] 5007 and H/3 is double- 
valued in some range of parameter space. The addition of 



[Nil] 6584 serves to break this degeneracy and we obtain a 
tight constraint on Z. 

The ionisation parameter is shown in the third column 
and is only well-constrained in this case when a good dust 
correction can be obtained and lines with a range in ion- 
isation potential are included. The dust-to- metal ratio of 
the ionised gas, ^, is only well constrained when lines from 
non-depleted elements, such as N and S, are included in the 
fit. Note that while CLOl assumed that S is not depleted 
in the ISM, it is not entirely cle ar that this is correct (e.g. 
ICalura et al.ll2009l : |jenkinsll2009l , and references therein), it 
is however outside the scope of this paper to study this fur- 
ther. As should be clear from Fig. (2] the quality of the con- 
straint on ^ is a strong function of abundance and at a low 
abundance we are not able to place any constraints on ^ at 
all. The conversion factor between Ha-luminosity and SFR 
was discussed in CLOl and B04 and we refer the readers to 
those articles (note that the similarity between the PDF for 
log?7HcK and logSgas is not a general feature). 

The PDF for Sgas is shown in the last column. The first 
point to notice is that the prior on Sgas is not flat. This is 
a natural consequence of it being a ratio of three quantities 
with flat priors. Thus we show the impact of the prior by 
also showing the PDF if one assumes a flat prior on Sgas 
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Figure 4. Similar to Fig. [S] but tlie emission lines are added in reverse order, providing a clearer illustration of the importance of 
[O ll]3727 and H/3 for the fitting procedure. The effect of applying a broad (blue) and narrow (red) prior on ^ is also indicated here. Since 
this is a relatively metal rich galaxy, the effect is modest on most quantities as long as enough emission lines are present. 



(blue line). This indicates that as long as the PDF is well 
constrained, the impact of the prior is minor. 

This figure is for a "typical" galaxy in the SDSS and 
hence has close to solar metallicity. The figure is representa- 
tive for such galaxies and the main points to note are: One 
of [Nii]6584 or [Sii]6717 should be available in addition to 
H/3, [O III] 5007 and Ha to break the abundance degener- 
acy. Fig. U shows the impact of lines in the reverse order 
and while in the shown case [O ii]3727 does not improve the 
constraints significantly in general it is beneficial to have 
this line as well. 

At lower metallicity we can still constrain the column 
density of dust using the model above, but as we are unable 
to place strong constraints on ^, our Sgas estimates have 
larger error-bars and we are somewhat sensitive to the prior 
placed on ^, thus we turn to look at this next. 



3.1 The impact of priors on ^ on the Sgas 
determinations 

An advantage of the Bayesian approach to parameter esti- 
mation is that it makes it necessary to state prior beliefs 
explicitly. In general the priors adopted will not be impor- 
tant if the evidence outweighs the prior belief. However for 
quantities that are not well constrained by the evidence, 
the derived constraints are usually affected by the priors 
adopted. 

By default the model parameters have flat priors over 



the range covered by the models. Note that this does of 
course imply a preferred median value in the case of no in- 
formation which is set by the mid-point of the range cov- 
ered by the parameters. However as long as most of the 
optical lines are present with good S/N this is unimportant 
and most quantities we are interested in, such as the oxygen 
abundance, 0/H, the star formation rate, SFR, and the dust 
attenuation, ry are unaffected by the choice of prior — at 
least as long as we do not adopt strongly peaked priors on 
the model parameters. 

This is not always the case for ^. This parameter has a 
more subtle effect on emission line fluxes and can only be 
reliably determined at high metallicity where its impact on 
cooling is significant (c.f. Fig.[T]). At low metallicity it is not 
possible to put strong constraints on ^ and its derived value 
is thus sensitive to the prior adopted. 

Since ^Z provides a constraint on the dust-to-gas ratio 
that we need to convert dust column density to gas col- 
umn density, it is natural to ask how sensitive Sgas is to 
the adopted priors. In Fig. [3] we show one example where 
adopting a flat prior on Egas for a metal rich system made 
little difference on the final result, but for a general study it 
is better to focus our attention on ^. 

To explore this dependence we therefore explore three 
different priors for ^. The default prior is flat between ^ = 
0.1 and ^ = 0.5, and then we have two priors taken to be 
Gaussians in ^. We centre both on ^ = 0.3 because, as we 
mentioned above, this is the average value for the SDSS 
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Figure 6. The figure shows the effect on our estimates of Sgas 
for all star-forming galaxies in the SDSS DR7 from changing the 
prior on ^ from fiat to a Gaussian with width a = 0.08 (grey 
shading/dashed line) or to a Gaussian with width cr = 0.15 (green 
shading/solid line). The dashed and solid lines show the median 
trend while the shaded area encloses 68 per cent of the scatter. 



and also in good agreement with theoretical models when 
adju sted to the same abundances as used in the CLOl models 
fe.g. lEdmundsll200ll : [c"alura et al.ll2008h . For the wide prior 
we take a width of cr^ = 0.15 while the narrow prior has 
a^ = 0.08. 

Fig. [5] compares the results of fitting with a flat prior 
on ^ (black line) to that obtained with the narrow prior on ^ 
(red line) and that of a broad prior (blue line). The object is 
a fairly low-metallicity galaxy (12 + logO/H ^ 7.8) so that 
the constraints on ^ are weak, the format is otherwise the 
same as in Fig. [S] It is quite clear from the figure that the 
effect of changing the prior on ^ is negligible for most param- 
eters, but using a peaked prior on ^ does lead to somewhat 
tighter constraints on Sgas- This improvement in the con- 
straints is of course prior dependent, thus the most crucial 
question is whether changing the prior leads to systematic 
differences in the gas mass estimates. 

Fig. \6\ shows that the effect of the prior on ^ on the 
gas mass estimates is modest. The figure shows the differ- 
ence between Sgas estimated using a flat prior and that ob- 
tained using a peaked prior. The dashed line shows the me- 
dian trend for the narrow prior and we plot it against the 
[N ii]6584/Ha ratio as an approximate proxy for metallicity. 
We can see that the difference is typically less that 0.05 dex 
even for the most peaked prior on ^. When viewed as rel- 
ative changes, we flnd that even at the lowest metallicities 
less than 5 per cent of the galaxies show changes in excess 
of 30 per cent. 

It is important to note that the choice of prior also 
has an effect at high metallicity. This is because although 
at high metallicity we can constrain the value of ^, apply- 
ing a peaked prior on ^ will bias its estimate towards the 
peak of the prior, with an effect that is stronger for a more 
peaked prior, as seen in the flgure, because stronger evi- 
dence is then required to overcome the prior belief. Figured 
shows, however, that as long as the emission lines are de- 
tected at sufficiently high signal-to-noise, the effect of a prior 
on ^ is very modest at high metallicity. Thus employing a 
^-prior primarily makes sense in low-metallicity systems, at 



high metallicity there is enough information to constrain ^ 
independently and applying a peaked prior could bias the 
estimate of ^ and Sgas • 

While we do not show it explicitly here, the effect of 
a prior on ^ on other parameters is very small, at least at 
low metallicity. The oxygen abundance, 12 + log 0/H, for 
instance, changes by less than 1 per cent when applying a 
narrow prior on ^. Thus while applying a prior on ^ does 
lead to slight, but systematic, offsets in our Sgas estimates, 
at least these are independent of other parameters to first 
order. We have however, not discussed in detail what an 
appropriate prior should be — the simplest choice is indeed a 
prior that is metallicity independent but as we will see later, 
in section [5Jl it might be better to implement a metallicity 
dependent prior. 



3.2 The importance of the attenuation curve 

Equation (|13p depends on the absolute value of the V-band 
optical depth, which implies a dependence on the attenua- 
tion curve adopted. While the shape of the optical attenu- 
ation curve is more constrained than, say, the UV, it must 
still be treated as an uncertain ingredient in the models. 
For a simple attenuation curve r(A) oc A~^, we have 



a: 



rv = 



H/3 



A- 



■ Inr 



(20) 



where r is the ratio of the Balmer decrement to the Case B 
ratio. Thus a change from n = —0.7 to n = — 1.3 leads to an 
change in log Sgas of —0.28 for the same Balmer ratio and 
corresponding changes to the SFR estimate. When applying 
the fitting methodology described above, the final differences 
are slightly different because of slight changes in the best- 
fit model. Overall however, parameters that do not depend 
directly on rv, such as 0/H, ^, and log [/, are very weakly 
affected by a change in n. 

More complex attenuation laws do not in general lead 
to constant offsets. We therefore have therefore derived es- 
timates of log Sgas for four different attenuation laws as fol- 
lows: 

(i) t(A) oc A"^-^. CFOO observed that a slope of -1.3 
corresponds to the middle range of the optical properties 
of dust grains between the Milky Way, the Large and the 
Small Magellanic Cloud. This was argue d to be appropri - 
ate for attenuat ion of birth clouds in da C unha et al.l (|2008h 
and lWild et al.l (|201lh . We adopt this as our reference law. 

(ii) t(A) oc A~°'^. This is the attenuation law recom- 
mended by CFOO for absorption ultraviolet to infrared con- 
tinua by the ambie nt ISM in sta r- forming galaxies, and used 
in e.g. Brinchmann et al.l (|2004h . However the attenuation of 
emission lines is likely less grey than this. 

(iii) t(A)/t(Av) = (1-m)(A/Av)-'-^+/u(A/Av)-°-^. This 
is the com bination of ( 1) an d (2), as suggested observa- 
tionally by Wild et al. (^2007) and motivated theoretically 
by da Cunha et al. (2008). /i here is the relative contribu- 
tion to the total rv contributed by the ambient ISM. A value 
of /i = 0.3 was suggested by CFOO, but it is also possible to 
fit this keeping /i a free parameter. 

(iv) r{X)/r{Xv) = (1 - /i)(A/Av)"'-^ + /iA-'^^^ with 
n(A) = 2.8/(1 + 3^7l7v) + (0.3 - 0.05/irv)(A - 0.55), with A 
measured in microns. This is the effective attenuation curve 
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Figure 5. Similar to Fig. [S] but tliis time sliowing a low metallicity galaxy (691-52199-0636 at z = 0.038). The results with a flat prior 
on ^ is shown in black, overplotted on this in red we show the results for the narrow prior on ^ and in blue that of a broad prior on ^. 
The effect of the prior is striking for ^ but rather modest for everything else. 

Table 2. Sections where the influence of non-standard priors is explored throughout the paper 



Flat prior on Egas versus the default peaked prior on Egas Figures [3] and [H 

The effect of peaked priors on ^ relative to the default flat prior Sections 13.11 and 15.31 

Results in section [6] are checked 



The impact of changing the dust attenuation laws 

The importance of a variable Xco on the tests of the method 



for sensitivity to the choice of ^ prior. 

Section [321 

All flgures use r(A) oc A"-*^'^ 

The comparisons in section [5.31 use 

Xco from equation (|23|) 



recommended by Chevallard et al (2013, submitted) based 
on an analysis of a range of radiative transfer models. 

These all give broadly speaking the same logSgas es- 
timates, but with systematic shifts between the determina- 
tions. To make this quantitative, we fit all SDSS DR7 star- 
forming galaxies with each of the four different attenuation 
curves. The median effect is such that with option 2 above, 
log Sgas is increased by 0.30 dex in the mean relative to our 
reference curve, while option 3 leads to log Sgas estimates 
higher by 0.19 dex in the mean than the reference values. 
The fourth option produces gas column densities that are 
0.06 dex higher than our reference value. 

In the following we will adopt option 1 above as it is 
a simp le law and it is also very similar to the widely used 
ISeatonI (1979) extinction curve at A > 4200 A. Furthermore 
option 1 gives results that are very similar to that of option 



4 which is the most rigorously justified curve, but which 
depend on one further parameter, /i, which is not well con- 
strained by emission lines alone. 



3.3 The importance of the abundance estimator 

It is well-known that different abundance estimators can give 
sig;nificant ly different oxygen estim ates for the same H ii re- 
gions fsee lKewlev fc Ellisonll2008l . for a comparison). Since 
our estimator depends on the absolute value of the metal- 
licity, we are affected by this uncertainty and it is therefore 
appropriate to revisit this issue for our technique. 

Abundance comparisons are generally made between di- 
rect estimators where auroral lines are used to determine 
electron temperatures appr opriate f or different ionisation 
stages of elements (e.g. Izotov et al.l 12006) and other tech- 
niques. The basic assumption is that the abundances pro- 
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Figure 7. Top panel: The difference in effective Te from our fits 
and the electron temp erature in the [O ill] z one from the direct 
method calculation by iBresolin et alj (|2009l V The agreement is 
satisfactory given that our fits do not include any particularly 
temperature sensitive lines. Bottom panel: The abundance as a 
function of the radius normalised by R25 for NGC 300. The solid 
red ci rcles show the oxygen abundance derived in IBresolin et al.l 
(|2009l ) using the temperature sensitive method. The box plots 
show the oxygen abundances estimated using our model fits, using 
only the [O ii]3727, H/3, [O iii]5007, Ha, and [N 11] 6584 hues in the 
fit. The horizontal line shows the median and the box encloses 50 
per cent of the likelihood while the error bars enclose 95 per cent 
of the likelihood. The blue dashed line is a local regression, or 
loess, smooth of the medians of our estimates. 



vided by the direct method are the most reliable and thus 
provide a reference abundance estimate. 

For our comparison here we wish to extend this compar- 
ison to as high metallicity as possible. To do so, w e took the 
comprehensive study of NGC 300 by Breso lin et al.1 (|2009l ) 
who published line fluxes for Hii regions across the galaxy 
as well as abundances derived using the direct method. We 
removed the de-reddening applied to the published fluxes 
and ran our models. The resulting comparison of oxygen 
abundance estimates is shown in Fig. [71 

The top panel compares the mean electron temperature 
in our models to that estimated from the direct method by 
Bresolin et al against the radius in units of R25 . The differ- 
ence between the Te predictions from our models and the 
published Tfom] estimates, including the published uncer- 
tainty on the T[oiii] values, are shown as box-plots where 
the horizontal line shows the median, the boxes enclose 50 
per cent of the likelihood, and the vertical bars enclose 95 
per cent, or d=2cr for Gaussian errors. The agreement is quite 
satisfactory, in particular given that our fits, in contrast to 



the direct method, do not include any lines that are strongly 
sensitive to Te, such as [O ill] 4363. 

In the bottom panel we show the resulting oxygen abun- 
dance gradient. The solid circles show the data from Bresolin 
et al, while the box plots show the results of our fits, with 
the dashed line indicating a local regression, or loess, smooth 
of the medians from our fits. The immediate point to no- 
tice is that the abundances are in very good agreement at 
ne arly all radii — this is in contrast to the comparisons 
in IBresolin et al.l ((2009|) which used fitting formulae for the 
strong line abundance estimates. 

In general we see similar agreement at lower metallici- 
ties although there has been some indication of a poorer fit 
in syste ms with si gnificantly enhanced nitrogen relative to 
oxygen ([Yin et al.l 12007). This is likely caused by the fixed 
abundance ratios adopted at a given metallicity in the cur- 
rent CLOl models. These systems typically have a very high 
SFR/M* and make up only a very small fraction of the sam- 
ple used in the present work. 

Most of our galaxies are more metal rich than the range 
probed in Fig. [71 so it is natural to ask whether the agree- 
ment continues or not. This is uncertain but here we will 
take the approach that we will a) test our gas mass esti- 
mates and look for offsets there and b) make the reasonable 
assumption that although there might be uncertainties in 
the absolute abundances, the relative abundances should be 
robustly determined. In passing we note that this uncer- 
tainty also leads to uncertainties in the determination of H2 
masses due to the metal dependence of the Xco conversion 
factor. Indeed with current parametrisations of this depen- 
dency, e.g. equations ([23|) and ([24|) below, the uncertainty 
on molecular masses due to metallicity uncertainties is as 
large, or larger than those intrinsic to our method. 



4 DATA USED 

In the next section we will discuss the tests we have made to 
verify the present method for gas column density and dust- 
to-gas ratio estimation. To carry out these tests we need 
both optical spectroscopy to run our method on, as well as 
resolved and integrated gas mass measurements from CO 
and H i observations to compare our results with. 

We use optic al spectroscopy from the Sloan Digital 
Sky Surve y (SDSSlYork et al.ll2000l ) Data Release 70 (DR7 
lAbazaiian et al.l l2009l V The SDS S obtain ed 5-band imag- 
ing using a drift-scanning camera (ICunn et al.l 199g ) on the 
2.5m telescope at Apache Point (|Gunn et al.ll2006l V Spec- 
troscopic targets were selected on th e basis of the i magin g 
and a tiling algorithm described in Blanton et al. (2003). 
The spectra were obtained using a fibre spectrograph with 
640 fibres per plate, with 3" fibre apertures. The p hoto- 
metric system is discussed by iFukugita et al.l (|l996l l with 
the photometric ca libration being covered in more detail in 
ISmith et al.l ((2002). Where relevant we use the photometry 
from the Catalogue Archive Serveo We classify galaxies 
into six classes following B04 based on their emission line 
properties. In particular we classify galaxies with S/N> 3 in 



^ http://www.sdss.org/dr7 
^ http://cas.sdss.org/dr7/en 
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H/3, [O III] 5007, Ha and [Nil] 6584 into star- forming, com- 
posite and AGN based in the leftmost diagnostic diagram 
in Figure [21 In the following we will focus on the star- forming 
(SF) class. 

We will also make use of the massive Integral Field 
Spe ctroscopy (IFS) mosaic of NGC 628 published by 
[Sdnchez et aP (|201lh as part of the PINGS projeclEl. This 
was obtained with the PPAK spectrograph on the 3.5m tele- 
scope at Calar Alto. We use the reduced data provided on 
the PINGS project page and selected a total of 87 regions 
across the galaxy (see Fig.[8|). For each region we sum up line 
fluxes within circular apertures of diameters of 6" to 24" , we 
used the range of aperture sizes to test the sensitivity of our 
results to aperture size, but as the results are only weakly 
sensitive to aperture size we will focus on results for 12" 
apertures in this paper unless otherwise stated. To ensure 
we have a consistent alignment of the images, we tie the IFU 
mosaic world coordinate system to that of the Ha image of 
NGC 628 provided by the SINGS survey (|Kennicutt et al.l 
l2003l ). 

We will also make use of reso lved Hi maps from the 
THINGS survey (IWalter et al.|[2Q08.) as well as CO maps 
from the BIMA S ONCll (iHelfer et all l2003l). Nobeyamg P 
(JKuno et al.l l200 7l) and HERACLES (|Lerovet aDl2009bl r 



FALFA and S05. In total this gives 11,253 unique matches, 
416 from CASS, 9,762 from ALFALFA and 1,259 from S05. 
We calculate H i masses in solar masses from the fluxes 
using 



Mhi[M0] = 2.36 X lOf'D'^S 



(21) 



where S is the flux in Jy, or Jy/beam for resolved data, 
and D is the distance in Mpc. For the S05 dataset, which is 
abundant in edge-on galaxies, we adopt the flux corrected 
for self-absorption, while for the other datasets we use the 
observed flux. Note that we do not correct for inclination 
effects because these would affect the Sgas derived from our 
method in the same way as it would the H i and CO maps. 
Thus throughout we will quote observed surface mass den- 
sities, not deprojected. 

For the molecular gas data, we follow iLerov et al.l (|2008h 
and calculate molecular surface mass density maps from CO 
J = 1 ^ maps using 



Sh2 [Mopc-'j = 4.4 



2 X 1020 



/co, 



(22) 



surveys — we use HERACLES data whenever possible but 
use the other data sets for reference. 

We match these datasets to the SDSS to flnd regions 
where we have resolved Hi and (optionally) CO maps as 
well as SDSS spectra which we can use to obtain total gas 
column densities using our method. We searched the SDSS 
for all RC3 galaxies with multiple SDSS spectra across the 
face of the galaxy and then matched this to the THINGS and 
HERACLES surveys. This resulted in the identiflcation of 
100 SDSS spectra in 12 galaxies, see Table [3] for an overview 
of these. All these galaxies have H i maps from the THINGS 
survey and CO maps from the HERACLES survey, cross- 
checked against the BIMA SONG and Nobeyama surveys 
when possible. For these maps we also measure gas surface 
mass densities in apertures with diameters of 6", 12" and 
24"', but the spectroscopic aperture is of course flxed to 3" 
by the SDSS flbre diameter. 

We will also make use of integrated Hi data and for 
this we make use of the ALFALFA survey a-40 release 
(iHavnes et al.ll2011), the second da ta release (DR2) for the 
GASS surv ev ([Catinella et al. 1201 2| ) and the compilation of 
Hi data bv lSpringob et al.1 (l2005l . hereafter S05). For GASS 
DR2 we use the catalogues provided by Catinella et al. 
(|2012l ). and for ALFALFA we u se the match to SDSS DR7 
provided bv lHavnes et al.1 (|201lh . We match S05 to the SDSS 
DR7 using a 5 arcseconds match radius, which is small but 
our aim is not to be complete here. We furthermore require 
that the velocity offset between the SDSS and H i redshifts is 
less than 250 km s~^. When there are Hi observations from 
more than one survey, our order of priority is GASS, AL- 



^ http : //www . ast . cam . ac . uk/ioa/research/pings/html/index . html 

^ Downloaded from 

http : //nedwww . ipac . caltech . edu/level5/March02/S0NG/S0NG . html 

^ Downloaded from 

http : //www . nro . nao . ac . jp/nro45mrt/C0at las/ 

^ Downloaded from 

http : //www . cv . nrao . edu/aleroy/HERACLES/Overview . html 



with /co being the CO intensity in K km s ^ . We convert 
the HERACLES J = 2 ^ 1 maps to J = 1 ^ maps 
using the c onversion /g o (2 -^ 1) = 0.8/co(l -^ 0) recom- 
mended bv iLerov etHI (|2008l ). Xco is the CO-to-H2 col- 
umn density conv ersion factor in un its of cm~^ K~^ km~^ s 
(see for instance iLerov et al.l bOOSi ). For Milky Way con- 
ditions it appears that a constant value of Xqq — 
2 X 1 0^° cm~^ K~^ km~^ s is appropriate fe.g. iDame et al.l 
|200l|), but as the galaxies considered here span a range in 
physical properties it is likely incorrect to adopt a flxed 
Xco fe.g. lArimoto et al.iri996l : llsraellll997bl ). Hence we also 
make use of a variable Xqq ■ For th i s we u se the metallicity 
dependent Xco from iBoselli et al.l (|2002l ) as our reference 
parametrisation. 



logXco = -1.01 log 0/H + 29.28, 



(23) 



but we have also explored the stronger trend advocated by 
llsraell (|l997bl ) viz. 



logXco = -2.6 log 0/H + 11.6. 



(24) 



Calibrations of Xco against metallicity in the litera- 
ture show a large variation between d i fferent authors (see 
Bolatto et aP l2008l : iLerov et al.l l201ll : iKennicutt fc EvansI 
2OI2I , for summaries) . They range from flnding no trend with 
metallicity, ie. a constant Xco via a logarithmic slope of 
— 1 to —2.6. As iBolatto et al.l (|2008|) points out, there are 
systematic variations between the techniques used to infer 
Xco but the implication is that one must view Xco as be- 
ing signiflcantly uncertain at low metalli city and a si milar 
behaviour is also found at z > 1 (Genzel et al.ll2012i ). We 
remark that although the majority of the effort lately has 
focused on the metallicity dependence of Xco , it is possible 
that other factors, such as the i ntensity of th e radiation fleld 
also could have an effect (e.g. llsraell 1 1997bh but we ignore 
this complication here. 



5 TESTS OF THE METHODOLOGY 

We will test the method outlined in section [2] in two steps 
that logically follow from viewing equation (|13p as having 
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Figure 8. Three images of NGC 628, on the left the SINGS Ho; image, in the middle the Hi image from the THINGS survey, and on the 
right the CO image from the HERACLES survey. The locations of the 87 regions in NGC 628 where we calculate gas column densities 
from the IFU, Hi and CO maps are overplotted as red circles. Note that the regions sample a range of Hl/CO ratios. 

Table 3. An overview of regions with Hi and CO interferometric data, while at the same time have spectroscopy from SDSS or from 
the NGC 628 IFU mosaic consistent with being ionised by stars. The complete table is provided electronically. The first column gives 
the name of the galaxy with a running index for the region appended. The second column gives the SDSS spectroscopic identification, 
followed by the position of the fibre the spectrum was obtained through. The final column gives the zero-offset index of this object in 
the MPA-JHU value-added catalogues(Brinchmann et al. 2004) found at http//www. mpa-garching.mpg.de/SDSS. 



Galaxy 


PlatelD-MJD-FiberlD 


Ra (J2000) Dec 


IDR7 


NGC2403-00 


2944-54523-237 


+07:36:51.23 


+65:41:01.65 


920120 


NGC2903-00 


2289-53708-615 


+09:32:10.09 


+21:30:08.26 


794801 


NGC2903-01 


2292-53713-310 


+09:32:09.68 


+21:31:06.60 


795658 


NGC2903-02 


2289-53708-616 


+09:32:00.35 


+21:27:40.40 


794802 


NGC2976-00 


1879-54478-291 


+09:47:15.31 


+67:55:00.05 


664806 


NGC2976-02 


1878-54474-601 


+09:47:07.99 


+67:55:53.14 


664573 


NGC2976-03 


1879-54478-281 


+09:47:07.99 


+67:55:53.14 


664798 



two components: The dust column density and the dust-to- 
gas ratio, obtained through a combination of the metallicity 
and the dust-to-metal ratio, of the gas. We commence our 
testing by checking our dust-to-metal and dust-to-gas ratio 
estimates. We do not test our metallicity estimates explic- 
itly here, see section [331 for a brief discussion of this. To test 
the gas column densities we take three different approaches 
that have different inherent systematic uncertainties and by 
combining these we should be able to provide a robust test of 
our method. The first technique compares Sgas determined 
from SDSS spectra and the NGC 628 IFU mosaic with that 
of H i+CO gas maps. The beams of the resolved maps are 
only larger than the SDSS fibres by a factor of 2-3 so we 
anticipate only moderate aperture effects in this compari- 
son. We will find below that our technique gives gas mass 
estimates that are in very good agreement with the H i+CO 
maps. 

However the derivation in section [2] is only strictly valid 
for angle- averaged quantities, not resolved properties as used 
in this comparison. Furthermore, the sample for which we 
have good quality Hi and CO maps is relatively modest in 
size, and the galaxies are fairly close to face-on. This leads 
us to the second test which looks at how our gas estimates 
change as the aperture sampled changes — we will show 
that this way we can recover trends of H i content in galaxies 



found in recent Hi surveys when the SDSS fibre samples a 
significant fraction of the stellar disk. 

The natural final step is to compare our gas content 
measurements from SDSS spectroscopy to measurments of 
total Hi masses which are available for large samples of 
galaxies today. This requires careful adjustments for the dif- 
ference in apertures and it turns out that although this com- 
parison works well, systematic uncertainties in the aperture 
corrections weaken any conclusion reached with this test. 
This comparison is therefore discussed in Appendix [Al 

Note that in the following we will also define 






M,, 



M* 



(25) 



as the gas ratio. Note that the second equality which we 
generally will assume holds, assumes that the scale-lengths 
of the gas and the stars are at least comparable. This is 
closely related to the gas fraction. 



/gas — 



. + S* 1 + 1/rg 



(26) 
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Table 4. Measurements for the regions indicated in Table O the complete table is provided electronically. 



Galaxy 



log Eh I 



log Emoi 
16% 



log Sspec 

median 



log Sspec 

84% 



logSs 



EW(Ha) EW(H/3) Ha/H/3 g- 



NGC2403-00 


1.07 


1.07 


0.59 


1.25 


1.92 


17.9 


3.1 


3.19 


0.16 


0.99 


NGC2903-00 


0.58 


2.00 


1.09 


1.21 


1.31 


73.0 


13.2 


5.36 


0.65 


1.58 


NGC2903-01 


1.21 


1.53 


1.07 


1.11 


1.18 


164.7 


21.0 


5.81 


0.49 


0.94 


NGC2903-02 


0.93 


0.95 


0.96 


1.29 


1.54 


44.6 


10.8 


3.51 


0.21 


0.64 


NGC2976-00 


0.87 


1.45 


0.93 


1.04 


1.15 


58.1 


15.8 


3.81 


0.91 


2.21 


NGC2976-02 


1.33 


1.87 


1.61 


1.78 


1.91 


207.9 


32.3 


4.24 


0.47 


0.94 


NGC2976-03 


1.33 


1.87 


1.67 


1.80 


1.91 


178.6 


27.3 


4.22 


0.47 


0.94 



0.5 



0.4 



i^ 0.3 



0.2 



0.1 




8.60 < OH < 8.70 
8.70 < OH < 8.80 ■ 

8.80 < OH < 8.90 
8.90 < OH < 9.00 
9.00 < OH 
9.10 < OH < 9.20 
9.20 < OH < 9.30 



5 

Ha/H|3 



Figure 9. The dust-to-metal ratio of the gas as a function of the 
Balmer decrement in seven bins in oxygen abundance; the inferred 
Ay using a ISeatonI (|l9 79l extinction curve is shown on the top 
axis for reference. The shading indicate the uncertainty on the 
mean — the spread around the mean is considerably larger. It is 
clear that the trend with dust attenuation is similar at different 
metallicities although there is a trend for an increasing depletion 
at higher metallicity. 
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Figure 10. The dust-to-metal ratio of the gas as a function of 
the oxygen abundance. The grey scale shows the conditional likeli- 
hood of log ^ in bins of O/H. The solid blue line shows the median 
trend and the yellow dashed-dotted line shows the linear fit at 
high metallicity given in equation 1271 Note the lack of constraints 
on ^ below an oxygen abundance of 8.6. 



5.1 The dust-to- metal ratio as a function of 
metallicity 

We discussed the properties of ^ in section [3] but we did not 
discuss what values our fitting procedure provides. In the 
Milky Way we know there is a cor relation betwe en depletion 
strength and (local) density fe.g. I Jenkins! l2009l ) and Fig. [9] 



shows that we find a similar result for our fits to star-forming 
galaxies in SDSS DR7. This figure plots our ^ values against 
Ha/H/3 as a proxy for line-of-sight attenuatio n. We indicate 
the V-band attenuation, Ay^ obtained using a lSeatonI (|l979r ) 
extinction curve on the top axis for reference. The figure 
was constructed by co-adding the individual PDFs of ^ and 
Ha/H/3. The PDF of the latter was calculated on a grid 
with bins of 0.043 while the PDF for ^ was interpolated 
onto a grid with bin size 0.004. We calculate the mean and 
uncertainty on the mean on the resulting combined two- 
dimensional PDF and only show the trends where at least 
10 galaxies contribute to the bin in Ha/H/3. In this and the 
following figure we use all SF class galaxies in the SDSS 
since we are comparing two quantities derived within the 
same region, the figure does not change significantly except 
in the metallicity ranged spanned when limiting attention 
to a specific redshift range. 

The solid line shows the mean and the shading the un- 
certainty on the mean for each trend. The results are shown 
for seven bins in oxygen abundance, from O.SZ© to ^ SZ©. 
We do not include lower abundance bins because, as we dis- 
cuss next, we are unable to constrain ^ accurately at low 
metallicity. 

The first point to notice is that we do see stronger deple- 
tion in more dusty systems, similar to the trend seen along 
different sight-lines in the Milky Way although the signifi- 
cant difference in methodology bars us from making a much 
stronger statement, and the trend is fairly weak. The second 
point to note is that although the depletion appears to vary 
systematically with oxygen abundance, the dependence on 
attenuation appears to be the same for most metallicities. 

This indicates a significant dependence of depletion on 
metal abundance and we make this explicit in Fig.llOlwhich 
shows the trend of ^ with oxygen abundance for star- forming 
galaxies from the SDSS DR7. This uses the full likelihood 
distributions but is shown conditional on the oxygen abun- 
dance, with the solid line indicating the median to guide the 
eye. This shows two noticeable features. Firstly we can see a 
clear correlation between ^ and 12 -h log O/H at high metal- 
licity — in the sense that we predict a stronger depletion of 
elements in galaxies with higher metallicity. For reference, 
the behaviour at high metallicity is well described by 



log^ = -4.45 + 0.43OH, 



(27) 



where OH = 12 + log O/H. This is a good fit to the data for 
12 + log O/H > 8.7 and is overplotted as a dashed-dotted 
yellow line in the figure. But note that this includes the trend 
with attenuation as well thus the intrinsic trend at fixed 
attenuation is slightly shallower but we ignore this here. 
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using our emission line modelling. In this equation fooR is 
the ratio of dust mass density to that of the total gas mass. 
For comparison to other work, which focuses on the mass 
ratio of dust relative to hydrogen, we adjust our estimates 
here by a factor of 1.38 to take out the contribution of he- 
lium to the gas mass densit y, a factor which uses the solar 
abundance ratios from Asplund et al.l (|2009h . 

To test our estimates, we focus our attention on the 
fooR versus oxygen abundance relation. We use the full 
likelihoods of these parameters and show the resulting dis- 
tribution in Fig. [11] The solid/dashed black line shows the 
median fooR at the given oxygen abundance, while the 
dashed/dotted lines show the 68 per cent spread around 
the median. In section 15.11 above, we showed that at 12 + 
log O/H < 8.6, our method is not able to place strong con- 
straints on ^ and as a consequence our estimate of fooR 
is much less certain. This region is therefore indicated with 
a different linestyle in the figure. To indicate the effect of 
modifying ^, we overplot the result of modifying our focR 
estimates using equation (|27)) as a thin blue line. At low 
metahicity the effect can be almost 0.5 dex. 

W e compare our estimates to those of iDraine et al.l 
(|2007l ) . who modelled the dust emission o f SINGS galax- 
ies using a range of MIR data, and those of Galam etz et al.l 
^2011) who used a similar approach but a different mod- 
ehing technique. These are overplotted in Fig. [TT]as red cir- 
cles (open for galaxies lacking a CO detection) and filled 
squares respectively. For the Draine et al data we have cal- 
culated oxygen abundances using our models using the data 
described in Moustakas et al. (2010 ). kindly provided by J. 
Moustakas. For the Galametz et al data we use the metallici- 
ties provided in their table 1. For consistency with Galametz 
et al, we have adopted the Xco dependency in equation (|23)) 
for all data shown here. For the Draine et al sample we use 
the gas masses measured within the reg ion where dus t emis - 
sion is detected — see the discussion in iDraine et al.l (|2007h 
for why this is the reasonable choice. 

It is also interesting to compare our results with the 
simple ass umption that (5dgr scales linearly with metallicity 
fc.f. .Draine et al.ll2007h . To calcula te this we have made use 
of the depletion patterns derived bv ljenkinsi (|2009|), adjusted 
to the CLOl abundances. This leads to a prediction for fooR 
of 



Figure 11. The solid/dashed line shows the median dust-to-gas 
ratio for the SDSS DR7 as a function of gas-phase oxygen abun- 
dance, with the dashed/dotted lines indicating the 68 per cent 
spread around this. The change in dashing at 12 + log O/H = 8.6 
indicates the region where constraints on ^ are poor. To illus- 
trate the effect this has, the blue solid line shows the effect of 
assuming that ^ follows equation (|27|) . The open and filled cir- 
cles show the (5dgr estimates from Draine et al (2007) for SINGS 
galaxies — filled circles for galaxies with CO detections, empty 
circles for galaxies without this . The solid blue square s are from 
the study of nearby galaxies by ICalametz et al.l (|201lh . The grey 
shaded region shows the expected locus if the dust-to-gas ratio is 
sc aled linearly wit h metallicity from the Milky Way value inferred 
bv lJenkinsl(|20Q9l V 



The second point to note in this figure, is that we can- 
not place any constraints on ^ at metallicities lower than 
12 + log O/H ^ 8.6, or ^ 50 per cent solar, recall that the 
solar oxygen abundance for the CLOl models is 8.81. This 
appears to be a consequence of the higher electron temper- 
ature in low metallicity gas and subsequent lower sensitiv- 
ity to ^ as indicated by Fig. [1] Thus we are unable to say 
whether the dust-to-metal ratio continues to decrease, as an 
extrapolation using equation (|27)) would predict, or whether 
the relationship at low metallicity is flatter, as for instance 
the chemical evolution models bv ICalura et al.l (2008) sug- 
gest and which the fit results hint at. 

In the following section we will therefore refer to two dif- 
ferent estimates of ^. Our first estimate will be that derived 
from the fitting method and hence which shows a flattening 
at low metallicity, the second will use the fit in equation (|27)) 
to adjust ^ after the fact — we can do this because the model 
fits at low metallicity are approximately independent of ^. 
In the rest of the paper we will for simplicity focus on the 
values provided by the fitting method with and without a 
prior on ^. 



5.2 Dust-to-gas ratios as a function of metallicity 

The dust-to-gas ratio in galaxies is a crucial parameter for 
a wide range of models, and has been extensively modelled 
with focus on dust e missivity (see e.g. the compilation in 
ICalametz et al.ll201lh . As remarked above a key ingredient 
in our approach to estimate gas content is that we can place 
constraints on the dust-to-metal, and by extension the dust- 
to-gas, ratio 



^DGR 



(5.3 X 10"^-1.1 X 10"^) 



f Q/H 

I (O/H), 



(29) 



fooR — ^Z, 



(28) 



Where the range is given for F* = to F* = 1. The re- 
sulting region is shaded in grey in Fig. 1111 It is clear that 
the linear scaling appears to be a decent overall description 
but our data differ from this somewhat in detail, having a 
steeper slope down to 1/2^©, and if we extrapolate with 
equation (|27|) we differ rather more at low metallicity. 

The main conclusion from this test is that there is a 
good general agreement between the measurements in the 
literature and our values despite the very different tech- 
niques used. A notable difference between our method and 
the other efforts in the literature is that our measure of 
focR is a local measure in contrast to methods that use 
data obtained over different areas and with different spatial 
resolution. 

It is also worth asking whether a scaled dust-to-gas ratio 
would be sufficient, since that appears to capture much of 
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trend in Fig. [TT] This does of course depend on the question 
asked, but it does rely on ^ being approximately constant 
and Fig. [9] and [10] show that this is a questionable assump- 
tion in detail. Ignoring the variation in ^ would lead to sys- 
tematic errors in the dust-to-gas ratio that could amount to 
as much as a factor of two across the bulk of the galaxies in 
the SDSS and this would be a significant source of error in 
many studies. 

5.3 Comparisons to resolved gas masses 

We can carry out a direct test of our gas measurements when 
we have spectroscopic observations of galaxies where we also 
have interferometric maps in Hi and CO. We will here use 
two sources of such spectra: fibre spectra from the SDSS and 
spectr a from the IFU mosaic of NGC 628 by I Sanchez et al.l 
(|201lh . We show these using different colours below to ver- 
ify that the method works equally well for both classes of 
spectra. 

As an illustration of our resolved maps, Fig. [12] shows 
the r-band. Hi, and CO images of MlOl (NGC 5457) with 
the location of spectra from SDSS indicated as red squares 
and NGC 628 was shown earlier in Fig. [8] The r-band im- 
age is a mosaic of SDSS r-band images, the H i map is from 
the THINGS survey and the CO map from the HERACLES 
survey. We have explored apertures from 6" to 24" when 
measuring Shi and Sco, corresponding to 0.2-0.8 kpc at the 
adop ted distance for MlOl of 7.4 Mpc ( Karachentse v et al.l 
|200J), and comparable sizes for the rest of the sample. Here 
we will only show results for apertures of 12" since this 
is close to the angu lar resolution of the HERACLES data 
(JLerov et al. ' 2009b). Note that this aperture is four times 
that of the SDSS data, for which all Sgas measurements are 
obtained in an aperture of 3". For the NGC 628 data we 
can synthesise spectra in arbitrary aperture sizes but since 
we work with the emission line maps rather than the data 
cube, this is not an exact sum of the emission flux within 
the aperture. 

For each spectrum we run our model fits as detailed 
above and inspect each resulting fit in detail. The SDSS 
spectra do not sample [Oil] 3727 and generally have fairly 
low S/N which occasionally leads to poorly constrained fits, 
while for the NGC 628 mosaic, the [S ii] line fluxes are occa- 
sionally significantly off and we therefore do not include [S ii] 
in the fits shown here. We exclude fits that give multi-modal 
PDFs and very low S/N spectra — quantitatively we make 
a cut at a Kullback-Leibner divergence of 15 for the Sgas 
PDF. In practice this corresponds to spectra with such weak 
emission lines that no constraint can be had for Sgas- This 
results in a total of 104 (59 from SDSS, 45 from NGC 628) 
high-quality measurements. The results do not depend much 
on these cuts because these poor fits merely add scatter to 
the relation and since the uncertainty is large their effect 
on any fit to the data is minor and we have verified that 
including them does not significantly affect the conclusions 
below. 

We compare the Sgas from our models with the values 
extracted from the H i+CO maps in Fig. 1131 The left panel 
plots Hhi for these regions against the spectroscopic Sgas 
estimates. The point sizes in this figure are inversely pro- 
portional to the uncertainty estimate on logSgas between 
0.1 and 0.6 dex. The blue line shows an unweighted bisector 



fit to the data which has a slope of 0.79 and an intercept 
of 0.43. The dashed red line shows the 1-1 relation with 
a factor of 2 offset indicated by the dotted lines. While the 
data clearly correlate with each other, the slope is noticeably 
different from 1. 

The right panel shows the same but now using the total 
gas mass on the abscissa. We calculated the molecular mass 
using the metallicity dependent Xco from equation (|23)) 
with the metallicity from our model fit. We have also ap- 
plied the broad prior on ^ although the effect of both of these 
corrections are moderate as the majority of the spectra are 
close to solar metallicit y. For NGC 4449 (r ed filled circle) we 
use the observations of iHunter et al.l (|200Q), whose region B 
overlaps with several of our spectra and when adjusted to 
our default Xco has Mn^/Mni = 0.225. The agreement is 
better in this panel with the bisector fit having a slope of 
0.93 with an intercept of 0.03. 

The overall agreement must be said to be good as long 
as the total gas mass is used, showing explicitly that the 
method traces the total gas surface mass density. The slope 
and intercept does depend somewhat on the aperture used 
for the gas measurements, whether a metallicity dependent 
Xco is used and whether or not a ^ prior is applied. For aper- 
tures of either 12" or 24", the slope of the best unweighted 
bisector fit varies from 1.06 to 0.93 and the intercept from 
-0.15 to 0.06, thus fully consistent with a 1-1 relation. 

The scatter around the best linear fit varies from 0.30 
to 0.36 dex, which is mostly explained by the uncertainties 
on the variables. It is difficult to accurately quantify the in- 
trinsic scatter in this relation as it depends sensitively on 
the uncertainty estimates on the CO measurements which 
are hard to quantify due to the aperture mismatch between 
the SDSS and the CO data, as well as uncertainties in the 
CO-H2 conversion factor, but fitting a linear relation with 
intrinsic scatter following .Kellv. (^2007) the data are consis- 
tent with a linear relation with intrinsic scatter of 0.1-0.15 
dex. 

Thus we can conclude that at least over the range of gas 
surface densities from 10-100 M© pc~^, our spectroscopic 
method gives results in agreement with those inferred from 
H 1+H2 maps to within a factor of two. Although shown only 
for an aperture of 12", this conclusion is robust to the aper- 
ture size chosen. Given the lack of suitable comparison data, 
we are unable to extend this to lower densities but when ap- 
plied to SDSS data the method appears well behaved down 
to at least a surface gas mass density of ^ 2M0 pc~^ where 
our sample starts to peter out. 

The figure also shows that our estimator provides a 
measure of the total gas mass. As an example to highlight 
this, the nuclear spectrum in MlOl (1323-52797-012) cor- 
responds to a location with essentially no Hi gas and the 
resulting mass density is plotted as an open circle in the 
right panel of Fig. 1131 and connected to the total (H1-I-H2) 
mass density by a green line with an arrow. 

There are however some cases where the method fails. 
The three main outliers are indicated with open squares in 
the figure. From left to right these are: the nuclear bar in 
NGC 2903 (2289-53708-615), which is well-kn own to have 
a part icularly high molecular gas density (e.g. iLerov et al.l 
2009b j, an off-centre region in NGC 3627, and a nuclear 
spectrum in NGC 3351. The off-centre region in NGC 3627 
has strongly discrepant H2 estimates between BIMA and 
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Figure 12. Left: SDSS r-band image of MlOl with the location of SDSS spectra overlaid as red boxes. Middle: Hi map of MlOl from 
the THINGS survey. Right: The CO gas mass map of MlOl from the HERACLES survey. 
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Figure 13. Left: Surface-mass density of Hi measured in a 12" aperture versus our spectroscopic gas estimates for 59 locations in 11 
nearby galaxies with SDSS spectroscopy (blue) and 45 locations in NGC 628 (green). The red solid symbol is for NGC 4449. The points 
have been scaled according to the uncertainty estimate so that the largest circles correspond to objects with la uncertainty on log Sgas 
less than 0.1 dex. The smallest symbols correspond to uncertainties larger than 0.6 dex. The dashed red line shows the 1-1 relation, with 
the dotted lines either side showing a factor of 2 spread. The solid blue line shows the best-fit bisector fit to the data. Right: Like the 
right-panel but now plotting the total gas content on the x-axis. We show this for a metallicity dependent Xqo and using the broad ^ 
prior. The green line with an arrow shows the contribution of H2 to the central aperture in MlOl with the location for Hi only shown as 
an open circle. From left to right the open squares indicate the position of NGC2903-00, NGC3627-01, and, NGC3351-00 see the text 
for a discussion. 



HERACLES and if BIMA were used instead of HERACLES 
it would fall close to the 1-1 line, but the nuclear spec- 
tra have consistently high mass estimates from BIMA and 
HERACLES and the disagreement with the spectroscopic 
estimates appears to be real. Quantitatively we start to see 
a systematic disagreement between our spectroscopic gas es- 
timator and the CO+Hi maps at surface mass densities of 
gas above ^ TSM© pc~^. 



It is important to test whether the agreement between 
the two gas mass measurements depend on other properties 
of the regions sampled in a systematic manner. We there- 
fore calculate the difference between logSgas inferred from 
Hl+CO data, the "direct" measure, and the spectroscopic 
estimate. In Fig. [T3]we plot this difference as a function of 
four observables. On the left we show it as a function of 



Shi and we see no clear evidence of a systematic mismatch 
between the two as a function of H i surface mass density. 

The second panel shows the difference as a function 
of equivalent width of Ha. No significant trend is seen, al- 
though we note that the systems with high total gas mass 
density tend to have low EWs. This is due to a combination 
of these being nuclear spectra, where the background stellar 
light is significant, and to the star formation here typically 
being obscured. There are no NGC 628 symbols here as the 
data release does not provide equivalent widths. 

In the third panel we plot the residuals against local 
colour (fibre magnitudes for SDSS and aperture photometry 
on the SDSS images of NGC 628 otherwise). Here we see a 
slight systematic offset at the red end, which is due to the 
nuclear spectra, where the very high H2 densities are found. 
This could conceivably be used to identify possible cases 
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Figure 14. The difference between the spectroscopic and direct gas mass estimates in 12" apertures plotted against the H I surface mass 
density (left panel), equivalent width of Ho; (second panel), the local g — r colour (third panel) and Hq;/H/3 flux ratio (rightmost panel). 
The SDSS data are shown in blue and the NGC 628 data in green, the other symbols are as in Fig. [13] The red horizontal line shows 
the zero level and the dashed lines around this shows a factor of 2 of scatter. We do not have equivalent width measurements for Ho; for 
NGC 628. 
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where our method fails, but we do not attempt to do that 
here. 

The final panel shows the residuals as a function of the 
Balmer decrement. Here we see no trend showing that our 
method works well for a wide range of optical depths. We 
have further verified that no systematic trends are visible as 
a function of oxygen abundance or the signal-to-noise of the 
spectroscopic gas estimate. 

We conclude from this that the method appears to work 
well for surface densities of gas < 50 M© pc~^ but appears 
to break down at surface densities > 100 M© pc~^. This 
limitation is expected, because at high optical depth the 
emission lines we see will typically come from a skin outside 
the high optical depth region. 

While this is a real limitation, we should emphasise that 
it is a problem only for nuclear spectra. The scale sampled 
by a 3" fibre varies from 42 pc (NGC 4214) to 200 pc (NGC 
3198), while the median fibre in the SDSS {z < 0.2) subtends 
^ 5kpc. On larger scales we will not be dominated by these 
extreme conditions in the majority of galaxies. 

5.4 Moving from smaller to larger scales 

The preceding section showed that our method provides gas 
mass measurements in good agreement with resolved gas 
measures for all but the highest gas densities. Formally, 
however, our derivation in section [2] is only valid for angle- 
averaged quantities, thus we would like to carry out our 
checks for larger samples and since most of the SDSS fibre 
spectra sample much larger scales than the ^ 100 pc our 
tests above explored, we would also like to extend our tests 
to larger spatial scales. 

In lieu of large samples of more distant galaxies with 
Hi maps, we have two options. We start by synthesising 
apertures of increasing size using the IFU mosaic of NGC 
628. This allows us to test how well we can recover the Hi 
content within the optical radius of the galaxy. The result of 
this exercise is shown in Fig. 1151 The box plots summarise 
the PDFs from our fits, while the blue line shows the total 
gas mass estimated from the Hi and H2 maps using the 
metallicity dependent Xco in equation (|23|) . The red line 
shows the result for a constant Xco- 

The overall agreement is very good as long as we take 
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Figure 15. The total gas mass within apertures with different 
radii for NGC 628. The results are plotted as box-plots with the 
box enclosing 50 per cent around the median of the PDF, and 
the error bars enclosing 95 per cent of the likelihood. The solid 
red line shows the total mass estimate from the H I and CO maps 
assuming our reference Xqq. The blue line with dashed error lines 
shows the result of assuming a metallicity dependent Xco? using 
the full PDF of 12 + log O/H from our fits. The flux uncertainties 
for each radius were scaled to have the same S/N as the second 
radial bin before running the fits. The top axis shows the redshift 
at which a 3" SDSS fibre would enclose a physical aperture with 
radius given on the lower axis. Note that as this is a cumulative 
plot, the error bars are not independent. 



into account the metallicity dependence of Xco, with the 
main uncertainty coming on the largest scales. The agree- 
ment is typically better than a factor of 2 across a wide range 
of sampled sizes. It is also worth noting that the agreement 
appears to be equally good regardless of whether the gas 
is dominated by H2 (within a radius of 3 kpc), or atomic 
gas. There does seem to be a tendency for systematic offsets 
at very large radii. The reason for this appears to be that 
only regions of relatively high star formation activity have 
sufficient S/N to have their line fluxes measured in the IFU 
data. These regions are typically more gas rich than the av- 
erage at those radii and hence the spectroscopic estimates 
are biased high, but this should not be taken to be a failure 
of the method, rather it is a sign that we are not comparing 
like with like here. 
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We can extend this kind of test to the full SDSS sample 
by exploiting the fact that the SDSS fibre aperture spans 
different physical sizes at different redshifts, thus as redshift 
increases we should trace a larger and larger fraction of the 
gas content of the galaxy. This will not provide total gas 
masses for the galaxies, but at z ^ 0.2 the SDSS fibre will 
subtend ^ 10 kpc and encompass ^ 50 per cent of the light 
of the galaxy in our sample, so one might expect that the 
gas content is more representative for the total. 

To test this we wish to compare our rgas-M* re- 
lation against the Mui/ M^-M^ relation. This has been 
well characterised locallv (JBothunl Il984l : iKannappanl l2004l : 
ICatinella et al.ll20ldJ2012l V and within large enough aper- 
tures, the dominant gas phase might expected to be atomic 
(but see Saintonge et al. 2011b ). As an example, ignoring H2 
for NGC 628 underestimates the total gas mass by < 0.2dex 
as long as we sum in apertures with radius > 4.5kpc. Thus 
the Mhi/M*-M* relation should be a good proxy for the 
Tgas-M* relation as long as the H i determinations are from 
a region larger than this. 

The result of this comparison is shown in Fig. 1161 The 
solid black line here shows the rgas — M* trend found for star- 
forming galaxies in the SDSS using the fibre spectra. Each 
panel corresponds to a range in physical size subtended by 
the SDSS 3" fibre, as indicated in each panel. The blue and 
red points are from the ALFALFA and GASS Hi surveys 
respectively, note that for GASS we only use the Hi de- 
tections as the gas mass estimates using our method only 
includes objects with measured gas content. Since only Hi 
detections are included these are likely to be biased high, 
so we also include the median rgas predicted using the pho- 
tometric gas content technique introduced by IZhang et al.l 
(|200 9. Z09), discussed further below, plotted as the solid 
purple line. This is calculated for all galaxies falling in the 
indicated radius/redshift bin. 

When sampling small scales, the gas content is low rela- 
tive to the stellar content, as one would expect i n most opti- 
cally bright galaxies (see for instance the atlas in Lero v et al.l 
((2008)). Nevertheless when the 3" fibre samples regions with 
diameter in excess of 5 kpc, we approach the same scaling 
law found by the integrated H i measurements but we always 
fah below the locus of ALFALFA and GASS detections. This 
is not unreasonable both because H i-detected galaxies form 
a gas-rich subset of all galaxies, and because gas disks gen- 
erally are more extended than the stellar disks of galaxies. 
We do, however, fall above the locus predicted by the pho- 
tometric estimator at high mass, indicating that the sample 
for which we can estimate gas masses is a biased sample and 
we will return to this topic in section [631 below. 

While this is an indirect test of our method, it is an 
important consistency check and taken together with the 
results for NGC 628 we argue that it is consistent with the 
assumption that our method can give reliable estimates of 
Egas within the optical disk of a galaxy. Taken together with 
the results of the previous section we conclude that the tech- 
nique works well from ^ 100 pc scales up to ^ 10 kpc scales. 
Our method will, however, not work outside the optical disk 
of a galaxy because it relies on a background source that the 
attenuation can be measured against. 

We have also verified, but do not show, that we can 
recover the scaling law between stellar surface mass den- 



sit y and the s t ar f ormation efficiency (SFR/Mgas) found 
bv I Lerov et al.l ((2008) using our method. 

In theory we could make the preceding comparison 
much more direct by exploiting the large samples of galax- 
ies with integrated H l masses. In practice, however, a direct 
comparison is problematic because of the very large differ- 
ence in apertures between the 3" aperture of the SDSS spec- 
trum and the 3.5' Arecibo beam. We have carried out this 
test and the details are discussed in Appendix [A] but al- 
though the results are encouraging, and consistent with the 
conclusions of the preceding tests, the strength of this test 
is limited by the significant systematic uncertainties on the 
aperture corrections. One might wonder if we could improve 
on this by focusing on a comparison for the more distant 
objects where the SDSS fibre subtends a considerably larger 
spatial size. However at those redshifts we hardly have any 
H I data to compare to so at present this is not a practical 
approach to take. 



5.5 Comparison to other techniques 

In closing this section, it is useful to compare our method 
to a selection of indirect gas predictors in the literature. 
These are generally calibrated on a sample of nearby galax- 
ies and applied to much larger samples. Early work on this 
made use of the fact that there is a good correlation be- 
tween galaxy colour and the ratio of Hi mass to B-band 

T 1 I 1 I 1 I ' 

luminosity (Roberts 1969; Bothun 1984). More recently, 
Kannappan (2004, K04) found a tight relationship between 
galaxy colour, k-corrected to 2; = 0, and Thi = Mhi/M*: 



logrHi = 1.46- im{u-r), 



(30) 



and a somewhat tighter relation with u — K which we will 
not use here. She showed that the scatter in this relation was 
only 0.42 dex and could therefore be used to give reasonable 
estimates of a galaxy's gas content on the basis of its colour. 
An improved version of this was presented by Z09 who 
showed that an even tighter relationship could be had by 
including the i-band surface brightness, in: 

logrni = -1.73238(^ - r) + 0.215182//^ - 4.08451, (31) 

with a scatter of only 0.31 dex. A similar scatter was also 
found for mass i ve g alaxies based on the GASS survey by 
ICatinella et al.l (|20ip), incorporating instead the near-UV to 
r colour and stellar mass surface density, /i*. For simplicity 
we will use the Z09 relationship in equation (|31|) , as this will 
suffice for our needs here. 

We calculated these gas mass estimates for the 
SDSS and NGC 628, when necessary calculating k- 
corrected colours using v ersion 4.1.4 of the kcorrect code 
([Blanton k, Roweisll2007h . In contrast to Z09 we also use the 
stellar mass estimates from the kcorrect code. For NGC 628 
we obtained photometry in the 6", 12" and 24" apertures 
on the basis of SDSS imaging of the object. 

An alternative t echnique, widely used at high redshift 
(e.g. lErb et al.l l2006|l , is to inve r t the Kennicutt -Schmidt 
(K-S) scaling law ('Schmidt"l959'; 'Ke nnicuttl ll998') between 
the surface mass density of gas and star formation rate per 
unit area: 



SsF 



2.5 X 10 ^gas5 



(32) 
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Figure 16. Each panel compares the scahng law between stellar mass and rgas in for star forming galaxies in the SDSS (solid line with la 
scatter shown as s haded grey region) to the scaling law between Mhi and stellar mass found using the ALFALFA survey (blue points), 
the GASS survey (|Gatinella et al.|[201Cll ) (red points) and the rgas values predicted using the Z09 photometric gas content estimator 
(purple solid line). The different panels correspond to different redshifts, here indicated by the size subtended by the 3" fibre used by 
the SDSS spectrograph. At the lowest redshifts the gas fractions are clearly below the mean gas fractions from the H I surveys because 
we sample the central parts of galaxies which are known to be less gas dominated, but as we sample more than ~ 5 kpc the trends found 
approach each other. 



where Hsfr is in units of M© yr~^ kpc~^ and Sgas is in units 
of M0 pc~^, and where uncertainties on the parameters are 
suppressed. We have written the equation here for a Salpeter 
IMF. To follow common usage in the literature we derive 
the SFR from the Ha-luminosity and correcting for dust 
attenuation using Ha/H/3, assuming a fixed Case B value 
of 2.86 and t(A) oc A~^"^, but we have checked that we get 
similar results if we use the more sophisticated modelling of 
B04. 



Fig. [T71 shows the comparison of these three gas esti- 
mators together with our spectroscopic method against the 
total gas content for the regions in nearby galaxies and NGC 
628. We have excluded the three regions marked with a red 
square in Fig. [13] for clarity as these are outliers for all the 
estimators. The dashed line in each panel is a 1-1 relation, 
while the dotted line is the best unweighted bisector fit to 
the data. For the K-S scaling relation and our method we 
also indicate the median perpendicular distance to the 1-1 



line, mp, as well as the perpendicular scatter around this 
line, Gp. 

We can immediately see that the K04 and Z09 estima- 
tors correlate poorly with the local gas content. This is not 
unexpected because they were calibrated on the integrated 
gas content of galaxies and they should not be expected to 
work for resolved properties. While a slightly unfair compar- 
ison, it does highlight very clearly the problem of applying 
a relation calibrated on local data outside of its range of 
validity — a similar caution would be valid for an applica- 
tion at high redshift as well. For statistical studies of total 
gas content in low redshift galaxies, they are however well 
suited. 

The Kennicutt-Schmidt relation, in contrast, is sensi- 
tive to local properties and it clearly does a much better job 
than the colour-based techniques in Fig. [171 Nevertheless, 
the scatter around the 1-1 line is 85 per cent larger then 
with our method and the correlation has a slope different 
from one. This is not surprising as it is known that the K- 
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Figure 17. A comparison of four different gas mass estimators 
plotted against the total (H l + molecular) gas mass using a metal- 
licity dependent Xqq on the x-axis. The top left panel shows our 
spectroscopic gas mass estimates, the comparison in the top right 
panel uses the th i - (u — r) calibration from K04, the bottom 
left the calibration from Z09 and the bottom right shows the 
results o f inverting the Kennicutt-Schmidt (K-S) relation from 
iKennicuttI Jl998). The dashed line is the 1-1 line, while the dot- 
ted line shows the best linear bisector fit to the data shown. Our 
estimator clearly does best, while the K-S relation does well but 
has more scatter and a slope differing somewhat from 1. 
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Figure 18. Top panel: The median logrgas as function of radius 
for four ranges in stellar mass. Note that in each range we see 
a similar trend towards higher rgas when we sample larger radii 
but that the typical rgas increases with decreasing stellar mass. 
Bottom panel: The median log rgas as a function of the size of the 
fibre in units of the half-light radius of the galaxy. Note the lack 
of correlation between the normalised radius and the gas ratio. 



S relation shows signific ant variations (e.g. iKennicutt et al.l 
12007 : Bigiel et al." 2008). But the main problems with this 
method are not seen in this figure: a) It is calibrated on 
nearby galaxies so it is not a given that it would work for all 
galaxies and at all redshifts and b) it is a transformation of 
the star formation rate so it would be meaningless to plot it 
against a star formation rate indicator. 



6 THE CENTRAL GAS CONTENT OF LOCAL 
GALAXIES 

As an example application of the methodology outlined 
above we now turn to look at how the central gas content in 
star-forming galaxies in the local Universe varies with the 
physical properties of the galaxies. We will concentrate on 
galaxies from the SDSS DR7 for which we have fibre spectra. 

Since the fibre spectra subtend a fixed size in observer 
space we are faced with the fact that we sample different 
parts of the galaxies at different redshifts. As discussed in 
appendix [A] aperture corrections suffer from substantial sys- 
tematic uncertainties thus we prefer here to focus on galax- 
ies with for which the aperture samples similar scales. Given 
the quantities we have available this could imply focussing 
either on a fixed physical scale or on a fixed fraction of the 
galaxy size. The former ensures that the galaxies considered 
all lie in a narrow redshift range, while the latter option 
mixes galaxies across a range of redshifts but might provide 
a more physically meaningful result. 

To illustrate the variation in properties with these two 
different definitions of central regions as well as to explore 
the impact of different aperture sizes, we calculate rgas in 
bins of stellar mass as a function of aperture size in physical 
units, as well as in bins of the size of the aperture relative to 
the half-light radius of the galaxy. The results are shown in 



Fig. \T8\ and show that rgas is effectively independent of the 
aperture size in units of the half-light radius of the galaxy 
out to ~ 2 X i?5o. If we contrast this with Fig. IA2I we see 
some differences which in part is due to that figure using H I 
only, adding H2 will flatten the central profiles, but it is also 
likely that part of this flat structure is a sign of the biased 
nature of our sample relative to the total galaxy population 
and we will return to this below. 

Since both these definitions of "central" are defendable, 
we will show results for both in the following. Specifically 
we will look at gas content in the central 2 zb 0.5 kpc, corre- 
sponding to 0.05 < z < 0.09, when we fix the physical scale. 
For simplicity we do not attempt to correct for aperture 
effects within this redshift interval, but based on Fig. [TSl 
these would be itO.ldex in median. We also will show re- 
sults when the sample is limited to 0.7 < r^ < 1.4, with 
rn = R/R50. The samples resulting from these two cuts 
both include ^ 7 x 10^ galaxies. These definitions should be 
considered a central, but not nuclear, gas content of star- 
forming galaxies in the nearby Universe. The qualitative re- 
sults in the following section are only weakly sensitive to this 
choice of aperture but the quantitative results do of course 
change. The ultimate consequence of this is that the results 
we derive below are not representative for the total gas con- 
tent of galaxies, for which suitable studies are available in 
the literature (e.g. Catinella et al.ll2012l : ISchiminovich et al.l 
i2010> : iiSaintonge et al. H2011ai ). 



6.1 The central gas-to-stellar ratio in galaxies in 
the nearby Universe 

We show our inferred variation in rgas in the central regions 
of galaxies in Fig. 1191 The details of the construction of 
these maps are given in appendix [C] but they are basically 
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Figure 19. The gas content of star-forming galaxies within 2 ± 0.5 kpc (left column) or within 0.7 < rn < 1-4 (right column) in the 
SDSS as a function of various physical parameters. Each quantity is shown with the average logrgas in the left panel with blue to red 
colour scale, while the panel of the right shows the mean redshift in the colour scale (green to blue) and overlaid a contour map of the 
fraction of all SDSS galaxies that we have gas measurements for (see text for details). The top row shows the trend with stellar mass and 
stellar surface mass density, while the following rows show rgas in the log M* versus rest-frame g — r colour plane, the log M>K-oxygen 
abundance plane, the log M* versus specific SFR plane, the BPT line ratio diagram and finally, the D4000Ar versus ii6A plane. Contour 
lines are overlaid in steps of 0.5 dex from logrgas = —2. In the mean redshift panel, the completeness contours are not shown for the 
BPT plane or the logM*-oxygen abundance plane since essentially all galaxies that we can plot in these planes have gas mass estimates. 
Notice the very regular trend with various parameters but note in particular that the trends are not a function of stellar mass only, or 
indeed any other single parameter considered. The dashed green line in each panel shows the median value of the quantity plotted on 
the y-axis as a function of the parameter on the x-axis. 
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2D histograms of the average rgas combined with a kernel 
estimator where there are few galaxies. The figure shows 
two main columns, on the left we show the results when 
limiting ourselves to a small range in physical aperture size, 
while the right hand column shows the same for a limited 
range in aperture size relative to half-light radius. In each 
row we have 2 + 2 panels with the left panel in each pair 
showing the average log rgas in each bin, while the right panel 
shows the mean redshift in each bin as a colour-scale and the 
completeness in each bin as a contour plot. We will discuss 
these further below but focus on the mean rgas maps first. 
The top row shows the variation in rgas in the stel- 
lar mass versus stellar mass surface density plane. The top 
colour scale gives the median log rgas at a given position and 
for ease of reading we have also overlain contour lines that 
are spaced 0.5 dex apart starting at log rgas = — 2. What is 
particularly noticeable here is that the contour lines are not 
parallel to either coordinate axis which implies that neither 
M* nor H* alone are sufficient to predict the central gas 
content of these galaxies. This was also pointed out by Z09 
and was one of their reasons for including a surface bright- 
ness term in their calibration for rgas versus colour, but note 
that their conclusion was relevant for the total gas content, 
here we show that a similar result is appropriate also for the 
central regions of galaxies. Also note that while the slope of 
the contour lines differ between the cut in physical size and 
in rn, as one would expect from Fig. [181 the basic conclusion 
is still the same. 

The second row shows the mean rgas in the plane of 
stellar mass versus rest-frame total g — r colour. We see that 
galaxies with a particular rgas can have a range of colours 
but here there is a qualitative difference between a limit 
in physical size and in rn- When we limit ourselves to the 
central 2 ±0.5 kpc, the gas content at fixed stellar mass does 
not vary much with integrated colour. In contrast, when we 
limit ourselves to a range in rn, we find a clear correlation 
so that redder galaxies have less central gas at a fixed stellar 
mass. 

The mass-metallicity plane is shown in the following 
row. This was examined also by Z09 and we find a similar 
diagram to those authors. What is particularly notable here 
is that there is a significant trend in r gas across the mean 
M* - Z relation (JTremonti et al.|[20o3 v Also note the con- 
stant rgas, particularly at 12 + logO/H > 8.6 and in the 
right column. This is consistent with the expectations of a 
closed-box model, but we note that this does not appear to 
hold everywhere in the M* — Z plane and indeed a full ex- 
ploration of this result is outside the scope of the present 
paper. 

The subsequent row shows the plane of stellar mass 
versus specific star formation rate, sSFR = SFR/M*. We 
see clearly that the gas fractions of galaxies with the same 
SFR/M* decline steadily with mass but that there is also a 
slight trend for galaxies at fixed M* to have declining gas 
fractions as the specific SFR declines, although the decline 
is not very strong. This is more visible when we focus on a 
range in rn- 

The penultimate row shows the gas fraction distribu- 
tions as a function of position in the BPT diagram. The 
main point to note here is that the trend is smooth with po- 
sition in the BPT diagram but with some trend across the 



main star forming ridge (c.f. iBrinchmann et al.ll2008l l and 
decreases towards the composite/ AGN branch. 

The final row shows how the gas fraction change with 
position in the D4000Ar versus USa diagram. Recall that 
galaxies with high USa have undergone a significant burst 
in the last ^ 1 Gyr, while the average age of the stellar pop- 
ulation typically increases with increasing D4000Ar . The first 
point to note that along the median line the gas fractions 
decline steadily towards larger DAOOOn ■ This median line is 
close to that expected for a slow declining star formation 
rate (e.g. Kauffman n et al.ll2003bl ). while above it we expect 
to find post-burst systems. 

While these 2D distributions really capture the inter- 
dependencies of various parameters, it is useful to simplify 
these into scaling relations. Focusing on the relationship 
with stellar mass and stellar surface mass density, we get 



logrg 



8.76 ± 0.03) - (0.474 zb 0.004) log M* (33) 
-(0.619 ±0.004) logs* (34) 



with a la scatter of 0.27 dex. If one only includes the stellar 
mass in the fitting, the best-fit relation is 



log rgas = (8.17 ± 0.03) - (0.925 ± 0.003) log M* 



(35) 



which has a la scatter of 0.33 dex. Qualitatively these 
relati onships are sim ilar to that f ound for total Hi (e.g . 
Catin ella et"alll2012l ) and CO fe.g. ISaintonge et aLllibllal i. 
but since the trends found here are for the central regions, 
they differ quantitatively, primarily in amplitude. 



6.2 The gas depletion time of galaxies in the 
nearby Universe 

Besides the fact that our method provides a measure of 
the gas content of the galaxies, one key advantage of our 
methodology over estimating gas content using the Schmidt- 
Kennicutt relation is that we can compare our gas content 
to the star formation rate. Since we have star formation es- 
timates for exactly the same regions we have gas estimates 
for, the calculation of the gas depletion time. 



tR 



M^; 



SFR' 



(36) 



is straightforward in principle. In practice we rewrite equa- 
tion (|36|) to give 



tR 



Area _ Area 

?7Ha2.gas = -J JtR, 



(37) 



where Area is the area subtended by the 3" fibre in square 
parsecs and rj^^ is the conversion factor between Ha lu- 
minosity and the SFR rfCharlot fc Longhettil [JOOll l defined 
through SFR = Lna/rjna- We then calculate the likelihood 
distribution for ft^^ and insert this in equation ()37|) to cal- 
culate a PDF for tR. This accounts for intrinsic correlations 
between the parameters and we use this in the following. 
None of the conclusions below would change significantly if 
we were to use point estimates of the SFR and Sgas and as- 
sume they were independent but the uncertainties would be 
more difficult to estimate and there is a s light overall bias in 
tR — this is expected, see appendix A in IBrinchmann et al.l 
(|2004r ) for a discussion. 

Fig. [20] is similar to Fig. [19] and shows how the me- 
dian tR varies across the same parameter spaces. Comparing 
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Figure 20. Similar to Fig. llQI but this time showing the median gas depletion time in each bin. 



these two figures it is immediately obvious that long deple- 
tion times do not directly follow high gas fractions. Another 
immediate result is that galaxies that have very long deple- 
tion times are located in very particular parts of most dia- 
grams. They have low S*, are red for their mass, lie well be- 
low the median mass-metallicity relation but also have very 
low SFR/M* for their mass as well as a low [O iii]5007/H/3. 
We note in passing that a similar class of objects can be 
identified if one uses the Z09 calibration for gas mass esti- 
mation. 



The D4000Ar-H5A diagram is particularly intriguing — 
it shows that along the median trend, shown by the green 
dashed line, the galaxies have near constant depletion time 
of ^ 10^ years in their central regions. However Fig.llQIshows 
that the gas fractions decrease steadily moving along the 
median curve from low to high D4000Ar . This strongly hints 
at a regulatory mechanism for the central star formation 
rate, similar to that proposed to explain the same result for 
the total depletion times of galaxies by ISchiminovich et al.l 
(2010 ), see also the following section. 
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The median (average) depletion time we are finding for 
tfie central 2 kpc is 2.2 (3.6) Gyr and a similar result if were 
to limit ourselves in rn instead. We note that this is very 
close to the mean mol ecular deplet i on tini e, 2.35 Gyr, found 
on '-^ Ikpc scales by iBigiel et al.l (|201lh in nearby spiral 
galaxies with CO and Hi maps. Since the inner 2 kpc is 
likely to have a very significant molecular component, these 
numbers should be comparable . 

In contrast to Bigiel et al., however, we find a signifi- 
cant variation in depletion time with galaxy properties in the 
central regions. Instead our results match closely the strong 
variation in molecular depletion time with galaxy proper- 
ties found bv ISaintonge et al.l (|2011bl l in galaxies from the 
COLDGASS survey and we will return to this in a future 
paper. 

The distribution of tn values in the log M*-sSFR dia- 
gram for the inner 2 zb 0.5 kpc is well fit by a simple linear 
function 

log tR = 8.99 ± 0.37 - (0.969 =b 0.003) log M* (38) 

- (0.948 ± 0.004) log sSFR (39) 

with a scatter of only 0.31 dex. In this equation, and in 
Figures [T9l and [20l the stellar mass is the total stellar mass, 
while the specific star formation rate is calculated for the 
region sampled by the fibre, as is the depletion time. If the 
stellar mass is measured within the fibre, the coefficients 
would change to (10.06, —1.04, —0.85) and if both the stellar 
mass and the specific SFR are the galaxy integrated values, 
the coefficients would be (12.38, —1.10, 0.73). It is important 
to emphasise that the gas content in these equations are for 
the central 2 kpc, and not for the galaxy as a whole; these are 
rather different quantities and we do not attempt to correct 
tn for aperture effects. 



6.3 Sample completeness and selection biases 

In the preceding section we have given an overview of the 
gas content and gas depletion times in the central regions of 
star- forming galaxies from the SDSS. However we are unable 
to measure the gas content of all galaxies and it is reasonable 
to ask to what extent our results are representative for the 
galaxy population and not significantly biased. 

The main bias is that for a large number of galaxies we 
do not have sufficient emission lines detected in the spectra 
to apply our technique. There is also an additional biases 
for some galaxies with emission lines where the ionisation 
source is not likely to be stellar. In this case the modelling 
outlined in section [3] is not applicable. Thus there is a set 
of galaxies which likely have gas, but for which we cannot 
measure Sgas using our method. 

To illustrate the effect of the former bias, the second 
panel in each pair of panels in Figures [19] and [20] shows a 
contour map of the fraction of galaxies for which we could 
determine Sgas with our method. This shows more or less 
the expected distribution: For blue and actively star form- 
ing galaxies we have Sgas estimates for most galaxies, while 
for red, massive and low sSFR galaxies we only have Egas 
measurements for a very small subset. 

A reasonable way to think about this is that where we 
have Sgas for more than 50 per cent of the galaxies our 
mean rgas estimates likely to be fairly representative for the 



galaxy population as a whole. This level is indicated by a 
thicker black contour line in the plot. At lower completeness 
levels we do not know exactly how representative our mea- 
surements are, but as the main cause of incompleteness is a 
lack of star formation those missed galaxies most likely have 
lower gas content than those included in Figures [19] and 1201 
In this case rgas will be an upper limit to the true mean, but 
it is not immediately possible to say what the implications 
are for the depletion time. 

A complementary method to test for the effect of in- 
completeness is to use a different technique to estimate the 
gas content and gas depletion time. We have two such meth- 
ods available to us: gas content measurements from the AL- 
FALFA survey and gas estimates from fitting formulae, in 
particular here the photometric method introduced by Z09. 
Unfortunately we can only use these methods to get total 
gas content so we cannot do a direct comparison here, but 
they are still very useful as a sanity check on the results. 

We focus on two distributions, and in Fig. [21] we show 
the variation of rgas (top row) and tn (bottom row) in the 
logM* versus logS* plane for our method in the two left- 
most panels in each row. These are the same displayed in 
Figures [T9] and [20] and are therefore estimates of central gas 
properties. These can be contrasted to the total estimates 
using the Z09 estimator in the third panel of each row and 
ALFALFA in the final panel in each row. 

The first point to note is that the rgas distributions are 
qualitatively similar in each panel. In particular we would 
expect that a 2 kpc radius (leftmost panel) for the lowest 
mass galaxies would approach the total gas content and in- 
deed we see a fairly close correspondence to the total esti- 
mates using the Z09 estimator. We also note that the AL- 
FALFA sample is biased to more gas-rich galaxies overall so 
differs significantly from the rest. 

The other striking result is that the distribution of tn 
is substantially different for the total estimator and the cen- 
tral. Again at low mass we can see a correspondence to the 
Z09 estimator and possibly also the ALFALFA measure- 
ment, but note that the colour scales are quite different. 

A slightly different picture is offered by the D4000iv 
versus H(5a plane shown in Fig. 1221 The run of rgas is again 
qualitatively similar in each case. However in this case the 
tn distribution is also comparable and in particular we note 
that the trend for higher depletion times at both low and 
high 116a at fixed D4000iv that we noted above, can also be 
seen, albeit at a lower S/N. 

This is a somewhat surprising result. The normal in- 
terpretation of strong Balmer absorption, here a high US a, 
is that it indicates a recent cessation of or strong drop in 
star formation activity (e.g. Wild et al.ll200 ^V What we are 
finding is that in these galaxies rgas is elevated (top row), 
particularly on a 2 kpc scale and that this increase is not 
fully taken out by the increase in star formation, thus tn in- 
creases. What Fig. [22] implies is that many of these systems 
still retain significant amounts of gas but that as the star 
formation has dropped tn is high. 

The population falling below the median line is less 
studied, but when matched in D4000iv and stellar mass to 
other star forming galaxies they are seen to have a signif- 
icantly, '-^ 0.7 dex, lower star formation rate than galaxies 
on the median line. They are then concieveably rejuvenated 
post-starburst galaxies. 
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Figure 21. The variation of rgas (top row) and Ir (bottom row) in the logM* versus logE* plane. The two left-most panels on each row 
are the same as shown in Figures fTol and [20] The third panel shows the mean log rgas and \ogtR in this plane using the Z09 photometric 
gas estimator, while the final panel shows the same using total gas masses from the ALFALFA a. 40 data release. Thus the two right-most 
panels show galaxy integrated quantities. Note that the behaviour for Ir is clearly different for the total versus central measurements, 
while the rgas trends are qualitatively similar, but the quantitative levels are somewhat different as the colour bars above shows 



These results are qualitatively consistent with a picture 
where most galaxies have a steady decline in gas content as 
they evolve and where inflow of gas leads to a star burst 
episode. In some cases when this ends, the star formation is 
quenched but all the gas is not removed from the system. 
The spread in Ir on the mean D4000Ar-H^A track is quite 
substantial so the eventual fate of these systems cannot be 
well constrained from just this plot but this might offer an 
interesting constraint on the gas cycle in galaxies. 



7 CONCLUSIONS 

In the preceding we have presented a method for obtain- 
ing dust-to-metal and dust-to-gas ratios, as well as gas sur- 
face mass densities in galaxies using optical spectra. We 
showed that the dust-to-gas ratios provided by our method 
give consistent results to those obtained from modelling of 
the dust emission although a detailed one-to-one compari- 
son was impossible with current data. We also showed that 
our method provides gas content estimates that agree with 
measurements of H2 + Hi to within a factor of ~ 2. This 
method requires emission lines that constrain the dust col- 
umn density and lines of elements that are both (mostly) 
unaffected by depletion onto dust grains as well as elements 
that are depleted onto dust grains and that are important 



coolants in the interstellar medium. In practice this means 
that most strong optical emission lines from [Oil] 3727 to 
[S ii]6716,6731 should be measured. 

The method provides total gas column densities unaf- 
fected by the substantial uncertainty on Xco, and because 
optical spectroscopy can provide much higher spatial resolu- 
tion than most interferometric studies can, while at the same 
time providing star formation rates in the same region, this 
offers potentially a great complement to CO and H i studies 
of nearby galaxies. It would for instance be possible to com- 
pare our gas masses to H 1+H2 masses to infer the CO-to-H2 
conversion factor for large samples, si milar to what wa s done 
for a sample of nearby galaxies by iLerov et al.l (|201lh . 

At the same time, rest-frame optical spectroscopy is 
fairly easy to carry out for star-forming galaxies over a wide 
range in redshift. As long as the spectra can be properly 
flux- calibrated, this opens up the possibility to trace the 
central gas content of galaxies over a wide range in redshift. 

However we saw that there are a few limitations of the 
method: At metallicities below 50 per cent solar, our con- 
straints on the dust-to-metal ratio are very weak and we 
need to apply a prior on this quantity. Overall this appears 
not to be a major issue as the effect of applying the prior 
is small and in comparison to the uncertainty in going from 
CO to H2 mass estimates, it is negligible. 

The second issue is that at high gas column densities. 
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Figure 22. The same as Fig. [21] but for the D4000Ar versus US a plane. Note that there are clear similarities between the total and 
central distributions here, in particular a high depletion time both for high and low USa at fixed D4000iv. 



Egas/M© pc^ > 75-100, the method breaks down. This is 
simply because the emission lines must probe through the 
gas-rich region in order to provide information on it, but at 
very high column densities the regions become opaque to op- 
tical radiation. This is a fundamental limitation of this tech- 
nique and does limit the usefulness of the technique when 
studying the very central regions of nearby galaxies. It is 
also likely that it will limit its usefulness to some classes 
of more distant galaxies although any firm conclusions on 
the latter must await a more careful assessment in future 
works. The method is also likely to break down at very low 
surface mass densities but we have not found this limit with 
our tests. From our tests we are confident that the method 
works well in the regime 10 < Sgas/M© pc~^ < 50-70, 
but the Egas estimates are well behaved at least down to 
Egas "^ 2Mq pc~^ below which only 1 per cent of the SDSS 
DR7 galaxies fall, but we are unable to test this directly. 

The final issue is that our method can only provide in- 
formation on gas in regions where there is star formation 
going on. This is a key reason why this method cannot be a 
replacement for CO and H i measurements of galaxies. How- 
ever this is offset by the fact that, particularly at higher red- 
shifts, the method provides a much higher spatial resolution 
than Hi and CO observations can provide, and it can ac- 
count for atomic gas whose detection at significant redshifts 
is infeasible with current facilities, and thus complements 
the methods very nicely. 

We demonstrated the usefulness of this method by ap- 
plying it to the star-forming galaxies in the SDSS and pre- 



sented a summary of the variation in gas content and gas 
depletion times in the central 2 ± 0.5 kpc of these galax- 
ies. These trends in general follow the trends for the overall 
gas content found in recent H i and CO surveys of nearby 
galaxies, so that the gas content of galaxies decreases with 
stellar mass and stellar mass surface density. However the 
large sample size has made it possible to highlight the com- 
plex variation of gas content and gas depletion time with 
physical parameters. In particular we found that galaxies 
with high and low Balmer absorption at a fixed D4000aa 
appear to have high gas depletion times both in the central 
regions as well as in integrated quantities, possibly refiecting 
a feedback cycle associated with star bursting galaxies. 
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APPENDIX A: APERTURE CORRECTIONS 

A key goal for our aperture correction scheme is to allow a 
direct comparison of spectroscopic gas mass measurements 
using the 3" SDSS fibre and Hi mass estimates obtained 
with Arecibo, whose typical beam size is 3.5' full- width half- 
power. Since the SDSS spectra rarely sample even 30 per 
cent of the total light of the galaxies and it is well-known 
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that the H i disks often extend significantly outside the stel- 
lar disk, a direct comparison without taking into account 
aperture effects would be very misleading. 

In order to compare the total Hi gas masses to those 
derived from optical spectroscopy we have to account for 
differences in aperture. To carry out these aperture correc- 
tions we need H i and optical radial profiles for a wide range 
of galaxies. We take these from two samples observed us- 
ing the Westerbork ar ray: the Urs a Major sample (UMa) of 
Verheijen, Tully et al (JTullv et al.| [l996: Verheiien & Sanc^ 
2OOII), and the WHISP survey JN oorder meer et al.1 12001 . 



20071 : iNoordermeer fc van der Hulst„ 2007) . 



For the UMa surve y we take surface photometry data 
from ITuIIv et al.l (| 19961 ) who presented results for 5, i?, 
/ and K. We mat c h this data to the Hi profiles from 
IVerheiien k, Sancisil (|200l|), kindly supplied in electronic 
form by M. A. W. Verheijen. For the WHISP survey we take 
the H I profile fits reported bv INoordermeer et al.l (|2005l l and 
complement these wi th B, R k I p rofiles derived from sur- 
face photometry by INoordermeer & van der Hulst (2007). 
For both surveys we ignore uncertainties in the fits as other 
uncertainties dominate. 

Given the 2D distributions of H i and light we can cal- 
culate the gas-to-stellar mass ratio, rgas: 



<R) 



Sgas(r <i^) 

/i*(r<i?) ' 



the gas fraction, fg. 

/gas(r <i?) 



(Al) 



(A2) 



/i*(r <i?) + Sgas(r <i?)' 

and the surface gas mass density: Sgas(r < R). Here we 
indicate explicitly that these values are integrated values 
inside a radius, R. 

This allows us to calculate aperture effects for these 
quantities as a function of radius. Note that /gas and rgas 
are of course closely related since 

/ga. = -^, (A3) 

J- ~r Tgas 

but we will treat them as two separate quantities as /gas 
depends on the absolute value of the stellar mass-to-light 
ratio, M*/L, within the aperture, while rgas only depends 
on the relative value between the small and large aperture. 

These quantities have quite different radial dependence, 
as illustrated in Fig. lAll As this figure shows (lower right 
panel), the behaviour of Sgas might be far from monotonic 
which means that this quantity is not suited for aperture 
corrections. Fig. lAll also shows a series of dotted vertical 
lines. These show the radius sampled by the SDSS fibre at 
redshifts 0.02, 0.05, 0.1, 0.2, 0.5 and 2.0, respectively. Since 
the median redshift of the SDSS is close to 0.1, we can clearly 
see that aperture corrections will be substantial regardless 
of indicator chosen. 

It turns out that both rgas and /gas are suitable for aper- 
ture correction, but as rgas does not depend on the absolute 
value of the M^/L ratio we choose to focus on this here and 
we write 



rgas (i?) /rgas (i?t( 



(A4) 



where R is the radius you wish to aperture correct to, and 
rgas{Rtot) is the radius corresponding to the total rgas- 

In the following we will derive a method to estimate Ar 
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Figure Al. An illustration of the relevant profiles for aperture 
correction. The galaxy is NGC 797 from the WHISP sample and 
note that 'gas' here refers to H I only. The top left panel shows 
the enclosed stellar mass as a function of radius, using a fixed 
M^/Lr = 1.78. The dotted vertical lines which are the same in 
all panels, indicate the radius corresponding to the radius of the 
SDSS 3" fibres at z = 0.02,0.05,0.1,0.2,0.5 and 2.0. The top 
right panel shows Ar from equation (|A4|) , the bottom left the 
corresponding Af for the gas fraction, and the bottom right the 
average gas column density inside radius r, Egas(r). From the 
figure it is clear that Egasdoes not show a monotonous trend so 
cannot be (easily) aperture corrected, while both Ar and Af are 
relatively well-behaved. 



for galaxies of different Hubble types, but it is worth com- 
menting on Riot in equation ()A4|) . This cannot be related 
to any quantity in the observed plane (such as the Arecibo) 
beam as that would give different aperture corrections for 
the same galaxy at different redshifts, so it has to be related 
to the physical properties of the galaxy. We will take it to be 
related to the radii containing 50 per cent and 90 per cent of 
the light below, but this is a significant source of systematic 
uncertainty which we will see below. 

We will also ignore M^/L variations with radius, as well 
as the molecular contribution to the total gas mass here. 
This is a reasonable approximation in view of the systematic 
uncertainty just mentioned, but also because their effects 
are opposite, including molecular gas typically decreases Ar, 
while including M^/L variations typically increases Ar. 

However Ar shows a significant dependence on galaxy 
morphology. This is not unexpected as the gas content of 
galaxies has long been kno wn to show a significant varia- 
tion with Hubble type (e.g. Roberts k Havneslll994l V This 
also extends to the distribution of gas in galaxies, and in 
Fig. IA2I which shows all aperture correction profiles, Ar for 
the combined WHISP and UMa sample. 

The thick solid lines show for clarity the average profiles 
in four bins in T-type, which are from the top down, [—1, 1), 
[0, 2), [2, 5) and [6, 10). As is clear from this figure there is a 
very noticeable type-dependence. However it is also a very 
clear spread around the mean. To explore the dependence on 
outer radius we also extrapolate the trends shown by fitting 
polynomials to the thick solid curves in Fig. IA2I 

To take the scatter properly into account we calculate 
the likelihood of a given aperture correction at a given ra- 
dius for a given T-type, P{Ar\R/R50,T). The likelihood dis- 
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I 



R/R50 

Figure A2. The aperture correction for rgas as a function of the 
radius and morphological type (indicated by colouring) for the 
combined WHISP and UMa samples. The thick solid lines show 
the average aperture correction profiles for four bins in T-type. 



tribution is derived from the observed distribution of UMa 
and WHISP galaxies assuming an error of 0.1 in Ar{R) and 
a type uncertainty of 1 class. When we apply this to the 
SDSS galaxies we convert the concentration, R9Q/R50, to T - 
type using a linear fit to the data in Shimas aku et al.l (|200lh 
propagating the uncertainties on R90 and R50 to provide 
a likelihood distribution for T. We use this to marginalise 
P{Ar\R/R50,T) over T. 

The result of applying this methodology to the galaxies 
in the SDSS with z > 0.01 and Hi detections from either 
S05, ALFALFA, or GASS, can be seen in Fig.[A3l This figure 
compares the aperture corrected values on the x-axis with 
our spectroscopic measures on the y-axis. To construct the 
main panel we have created a 2D PDF assuming the PDF 
of the quantity on the x and y axis are independent, a very 
reasonable assumption, and then added those 2D PDFs to- 
gether. The shaded region shows the best linear fit to the 
data using the Bayesian fitting technique described in Ap- 
pendix [B] with 68 per cent and 95 per cent posterior con- 
fidence intervals indicated. In the main panel we have as- 
sumed an outer radius for the Hi equal to 4 x Rgo, where 
R90 is the radius containing 90 per cent of the r-band light. 

The smaller panels show the effect of using other outer 
radii, with the top panel using the 1.75' radius of the Arecibo 
beam, and the three panels below using the indicated mul- 
tiple of Rqq . All aperture correction schemes using an outer 
radius equal to a multiple of Rqq results in a final relation 
that has a slope close to 1 (from 0.97 in the main panel to 
1.06 for an outer radius equal to R90), but with a systematic 
variation in the zero point. In contrast, assuming an outer 
radius equal to the Arecibo beam leads to a poor aperture 
correction and a best-fit line with slope 0.76. 

We conclude from this that while the unit slope is in- 
dicative of a good agreement between the aperture corrected 
Hi content and our spectroscopic gas mass estimates, the 
systematic uncertainties in this comparison are sufficient to 
render this a rather weak test of our gas estimator. 



Map 




Bin size 


Kernel bandwidth 


gM* vs logE* 




[0.05, 0.04] 


[0.20, 0.20] 


g M* vs g — r 




[0.05, 0.02] 


[0.20, 0.10] 


gM* vs 12 + logO/H 




[0.05, 0.02] 


[0.20, 0.20] 


g M* vs log SFR/M* 




[0.05, 0.03] 


[0.20, 0.20] 


g [Nii]/Ha vs log [Oiii]/H/3 


[0.02, 0.02] 


[0.10, 0.10] 


4ooon vs as A 




[0.01, 0.17] 


[0.10, 0.40] 



Table CI. Bin sizes and kernel bandwidths used for the 2D gas 
maps in Figures [TOl- [22] Note that for ALFALFA the bin sizes is 
twice the width indicated here. 



APPENDIX B: BAYESIAN LINEAR 
REGRESSION WITH NON-GAUSSIAN ERROR 
BARS 

In Appendix [A] we need to fit a linear regression line for 
data with heteroscedastic and non- Gaussian error bars. In 
the case of heteroscedastic and Gaussian error bars, this is 
well known in the literature as fitting a linear functional 
relationship and has been used in ast r onomy for quite some 



relationshi p and has been used m ast r onomy tor quite some 
time (e.g. lAkritas fc Bershadvl Il996l : iKellvl l2007l : lAndreonI 
I2OIOI I. 



Thus we write our equations as 

X = U -\- Cx 

y = V -\- Cy = a -\- bU + noise 



(Bl) 
(B2) 
(B3) 



and where we assume that we know FDF(U-\-ex) = FDFx{x) 
and PDF(y + a) = PDF^(a + bx). We can then write the 
probability distribution for the slope and intercept as: 

P{a,b\x,y) (xP{x,y\a,b) = FDFx{x)FDFy{a + bx). (B4) 

Where we have assumed uniform priors on a and b, alterna- 
tivel y it might be b etter to use a uniform prior on the angle 
(e.g. lAndreonlbOlOi ) but the constraints are tight on b so it 
is not crucial here. 

Given the small samples required we calculate the pa- 
rameters on a grid rather than using Markov Chain Monte 
Carlo. We marginalise over U and V to get the PDFs for a 
and b. To get the confidence interval in Fig. IA3I we insert 
the PDFs for a and b in equation ()B1|) . 



APPENDIX C: THE PRACTICALITIES OF 
CREATING 2D MAPS OF GAS CONTENT 

In section [6] we show a number of 2D maps of gas content and 
depletion time in nearby galaxies. For completeness and re- 
producibility this section describes in detail how these were 
constructed. 

For each panel all objects that were classified as star- 
forming using the BPT diagram and for which we have Sgas 
estimates from the spectra and that satisfies the specific 
redshift or R/R50 cuts are identified. For each diagram of x 
versus y we created a 2D histogram for these objects with 
100 bins in each direction where we calculate the mean log 
rgas or login in each bin. The resulting bin sizes are given 
in Table ICll We identify the bins with less than N -\- S ob- 
jects and for the objects that fall in these bins we use a 2D 
kernel density estimator using a Gaussian kernel with the 
bandwidths given in Table ICli again calculating the mean 
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2xR90 




-3 -2-10 1 

Log Tg^, [H I, apcorr] 

Figure A3. The main panel shows a comparison of Mhi/M*, aperture corrected down to a 3" aperture, with our spectroscopic gas 
estimates. The grey scale shows the sum of the 2D PDF for this comparison, assuming independence. The shaded region shows the best 
fit linear fit to the data with the shading indicating 68 per cent and 95 per cent confidence intervals on the posterior predicted values (see 
Appendix [Al for details). For the main panel it was assumed that the total Hi resides within 4 times the radius containing 90 per cent of 
the light, i?9o, while the small panels on the side show the same for an outer radius of i^go, 2 x i^go, 3 x Ro,q and that the outer radius 
is identical to the Arecibo beam. The solid red line in each panel is the median fit using the Bayesian method described in Appendix [B] 
while the dashed line shows the 1-1 line for comparison. 



in each bin. We then replace all bins with less than N ob- 
jects with this kernel estimates. We use A^ = 100 and ^ = 5 
for the plots in this paper. This combination of techniques 
gives good resolution where there are many objects while 
improving the S/N in the regions with few galaxies. For dis- 
play purposes we show the maps only where the sum of the 
kernel contributions to that bin is larger than 1. We use the 
same method also for the maps using the Z09 photometric 
estimator and using the data from ALFALFA, but in the 
latter case we use 50 bins in each direction given the much 
smaller sample. 
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